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Description 

BACKGROUND OF THE INVENTION 
5 1 . Reld of the Invention 

[0001] The present Invention relates to novel polynucleotides derived from microorganisms belonging to corynefonm 
bacteria and fragments thereof, polypeptides encoded by the polynucleotides and fragments thereof, polynucleotid 
arrays comprising the polynucleotides and fragments thereof, computer readable recording media in which the nucle- 
10 otide sequences of the polynucleotide and fragments thereof have been recorded, and use of them.as well as a method 
of using the polynucleotide and/or polypeptide sequence information to make comparisons. 

2. Brief Description of the Background Art 

IS [0002] Corynefonn bacteria are used in producing various useful substances, such as amino acids, nucleic acids, 
vitamins, saccharides (for example, ribulose), organic acids (for example, pyruvic acid), and analogues of the above- 
described substances (for example, N-acetylamino acids) and are very useful microorganisms industrially. Many mu- 
tants thereof are known. 

[0003] For example, Corynebacterium glutamicum is a Gram-positive bacterium Identified as a glutamic acid-pro- 
20 ducing bacterium, and many amino acids are produced by mutants thereof. For example, 1,000,000 ton/year of L- 
glutamic acid which is useful as a seasoning for umami (delicious taste), 250,000 ton/year of L-lysine which is a valuabi 
additive for livestock feeds and the like, and several hundred ton/year or more of other amino acids, such as L-arginine, 
L-proline, L-glutamine, L-tryptophan, and the like, have been produced in the world {Nikkei Bio Yearbook 99, published 
by Nikkei BP (1998)). 

25 [0004] The production of amino acids by Corynebacterium glutamicum is mainly carried out by its mutants (metabolic 
mutants) which have a mutated metabolic pathway and regulatory systems. In general, an organism is provided with 
various metabolic regulatory systems so as not to produce more amino acids than it needs. In the biosynthesis of L- 
lysine, for example, a microorganism belonging to the genus Coryr^ebacterium is under such regulation as preventing 
the excessive production by concerted inhibition by lysine and threonine against the activity of a biosynthesis enzyme 

30 common to lysine, threonine and methionine, i.e., an aspartokinase, (J. Biochem., 66: 849-859 (1969)). The biosyn- 
thesis of arginine is controlled by repressing the expression of its biosynthesis gene by arginine so as not to biosyn- 
thesize an excessive amount of arginine {Microbiology 142. 99-108 (1996)). It is considered that these metabolic 
regulatory mechanisms are deregulated in amino acid-producing mutants. Similarly, the metabolic regulation is dereg- 
ulated in mutants producing nucleic acids, vitamins, saccharides, organic acids and analogues of the above-described 

35 substances so as to improve the productivity of the objective product. 

[0005] However, accumulation of basic genetic, biochemical and molecular biological data on coryneform bacteria 
is insufficient in comparison with Escherichia coli, Bacillus subtilis, and the like. Also, few findings have been obtain d 
on mutated genes in amino acid-producing mutants. Thus, there are various mechanisms, which are still unknown, of 
regulating the growth and metabolism of these microorganisms. 

^ [0006] A chromosomal physical map of Coryr^ebacterium glutamicum ATCC 13032 is reported and it is known that 
its genome size is about 3,100 kb (Mo/. Gen, Genet, 252: 255-265 (1996)). Calculating on the basis of the usual gene 
density of bacteria, it is presumed that about 3,000 genes are present in this genome of about 3, 1 00 kb. However, only 
about 100 genes mainly concerning amino acid biosynthesis genes are known in Corynebacterium glutamicum, and 
the nucleotide sequences of most genes have not been clarified hitherto. 

^ [0007] In recent years, the full nucleotide sequence of the genomes of several microorganisms, such as Escherichia 
coil, Mycobacterium tuberculosis, yeast, and the like, have been detemnined {Science, 277. 1453-62 (1997); Nature, 
393: 537-544 (1998); Nature, 367: 5-1 05 (1 997)). Based on the thus determined full nucleotide sequences, assumption 
of gene regions and prediction of their function by comparison with the nucleotide sequences of known genes have 
been carried out. Thus, the functions of a great number of genes have been presumed, without genetic, biochemical 

50 or molecular biological experiments. 

[0008] In recent y ars, moreover, techniques for monitoring xpr ssion levels of a great number of genes simulta- 
neously or detecting mutations, using DN A chips, DNA anrays or th like in which a partial nucleic acid fragment of a 
gene or a partial nucleic acid fragment in genomic DNA other than a gene is fixed t a solid support, have been 
developed. The techniques contribute to the analysis of microorganisms, such as y asts, Mycobacterium tuberculosis, 

S5 Mycobacterium bovis used in BCG vaccines, and the like {Sdence, 278: 680-686 (1997); Proc. Natf. Acad Sa\ USA, 
96: 12833-38 (1999); Science, 284: 1520-23 (1999)). 
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SUMMARY OF THE INVENTION 

[0009] An object of the present invention is to provide a polynucleotide and a polypeptide derived from a microor- 
ganism of coryn fonn bacteria which are industrially useful, sequence infomiation of the polynucleotide and th 
s polypeptide, a method for analyzing the microorganism, an apparatus and a system for use in the analysis, and a 
method for breeding the microorganism. 

[0010] The present invention provides a polynucleotide and an oligonucleotide derived from a microorganism be- 
longing to coryneform bacteria, oligonucleotide arrays to which the polynucleotides and the oligonucleotides are fixed, 
a polypeptide encoded by the polynucleotide, an antibody which recognizes the polypeptide, polypeptide arrays to 
10 which the polypeptides or the antibodies are fixed, a computer readable recording medium in which the nucleotid 
sequences of the polynucleotide and the oligonucleotide and the amino acid sequence of the polypeptide have been 
recorded, and a system based on the computer using the recording medium as well as a method of using the polynu- 
cleotide and/or polypeptide sequence infonmation to make comparisons. 

15 BRIEF DESCRIPTION OF THE DRAWING 

[0011] Rg. 1 is a map showing the positions of typical genes on the genome of Corynebacterium glutamicum ATCC 
13032. 

[0012] Fig. 2 is electrophoresis showing the results of proteome analyses using proteins derived from (A) Coryne- 
20 bacterium glutamicum ATCC 13032, (B) FERM BP-7134, and (C) FERM BP-158. 

[0013] Rg. 3 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

[001 4] Fig. 4 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

[0015] This application is based on Japanese applications No. Hei. 11-377484 filed on December 16, 1999, No. 
2000-159162 filed on April 7, 2000 and No. 2000-280988 filed on August 3, 2000, the entire contents of which are 

30 incorporated hereinto by reference. 

[0016] From the viewpoint that the detennination of the full nucleotide sequence of Corynebacterium glutamicum 
would make it possible to specify gene regions which had not been previously identified, to detemnine the function of 
an unknown gene derived from the microorganism through comparison with nucleotide sequences of known genes 
and amino acid sequences of known genes, and to obtain a useful mutant based on the presumption of the metabolic 

35 regulatory mechanism of a useful product by the microorganism, the inventors conducted intensive studies and, as a 
result, found that the complete genome sequence of Corynebacterium glutamicum can be detennined by applying the 
whole genome shotgun method. 

[0017] Specifically, the present invention relates to the following (1) to (65): 
10 (1 ) A method for at least one of the following: 

(A) identifying a mutation point of a gene derived from a mutant of a corynefomi bacterium, 

(B) measuring an expression amount of a gene derived from a corynefomi bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 
^ (D) analyzing expression pattems of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a corynefonn bacterium, 
said method comprising: 

(a) producing a polynucleotide array by adhering to a solid support at least two polynucleotides selected 
so from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 

one of SEQ ID N0S:1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 
stringent conditions, and third polynucleotides comprising a sequence of 10 to.200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
55 ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 

labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 
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As used herein, for example, the at least two polynucleotid s can b at least two of the first polynu- 
cleotides, at least two of the second polynucleotides, at least two of the third polynucleotides, or at least 
two of the first, second and third polynucleotides. 

5 (2) The method according to (1), wherein the corynefonm bacterium is a microorganism belonging to the genus 

Corynebacterium, the genus Brevibacterium, or the genus Microbacterium, 

(3) The method according to (2). wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogtutamicum, Corynebacterium callunae, Corynebacterium hercuiis, Corynebacterium iiiium, Corynebacteri- 

10 um melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes, 

(4) The method according to (1), wherein the polynucleotide derived from a corynefonm bacterium, the polynuce- 
lotide derived from a mutant of the corynefonm bacterium or the polynucleotide to be examined is a gene relating 
to the biosynthesis of at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid, and analogues thereof. 

15 (5) The method according to (1), wherein the polynucleotide to be examined is derived from Escherichia coli, 

(6) A polynucleotide array, comprising: 



at least two polynucleotides selected from the group consisting of first polynucleotides comprising the nucle- 
otide sequence represented by any one of SEQ ID NOS:1 to 3501, second polynucleotides which hybridize 
20 with the first polynucleotides under stringent conditions, and third polynucleotides comprising 10 to 200 con- 

tinuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 



As used herein, for example, the at least two polynucleotides can be at least two of the first polynucleotides, 
25 at least two of the second polynucleotides, at least two of the third polynucleotides, or at least two of the first, 
second and third polynucleotides. 

(7) A polynucleotide comprising the nucleotide sequence represented by SEQ ID NO: 1 or a polynucleotide having 
a homology of at least 80% with the polynucleotide. 

(8) A polynucleotide comprising any one of the nucleotide sequences represented by SEQ ID N0S:2 to 3431 , or 
30 a polynucleotide which hybridizes with the polynucleotide under stringent conditions. 

(9) A polynucleotide encoding a polypeptide having any one of the amino acid sequences represented by SEQ ID 
NOS:3502 to 6931, or a polynucleotide which hybridizes therewith under stringent conditions. 

(10) A polynucleotide which is present in the 5' upstream or 3' downstream of a polynucleotide comprising the 
nucleotide sequence of any one of SEQ ID N0S:2 to 3431 in a whole polynucleotide comprising the nucleotide 

35 sequence represented by SEQ ID NO:1 , and has an activity of regulating an expression of the polynucleotide. 

(11) A polynucleotide comprising 10 to 200 continuous bases in the nucleotide sequence of the polynucleotide of 
any one of (7) to (1 0), or a polynucleotide comprising a nucleotide sequence complementary to the polynucleotid 
comprising 1 0 to 200 continuous based. 

(12) A recombinant DNA comprising the polynucleotide of any one of (8) to (11). 

^ (13) A transfonmant comprising the polynucleotide of any one of (8) to (11) or the recombinant DNA of (12). 

(1 4) A method for producing a polypeptide, comprising: 

culturing the transfomnant of (13) in a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of (8) or (9) in the medium, and 
45 recovering the polypeptide from the medium. 

(1 5) A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, 
and analogues thereof, comprising: 

so culturing the transfomnant of (13) in a medium to produce and accumulate at least one of an amino acid, a 

nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof in the medium, and 
recovering the at least one of the amino acid, the nucleic acid, the vitamin, the saccharide, the organic acid, 
and analogues thereof from the medium. 

55 (1 6) A polypeptide encoded by a polynucleotide comprising th nucleotid sequence selected from SEQ ID NOS: 

2 to 3431. 

(17) A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931. 

(18) The polypeptid according to (16) or (17), wherein at least on amino acid is deleted, replaced, inserted or 
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added, said polypeptid shaving an activity which is substantially th same as that of th polypeptide without said 
at least one amino acid deletion, replacement, insertion or addition. 

(19) A polypeptide comprising an amino acid sequence having a homology of at least 60% with the amino acid 
sequence of the polypeptide of (16) or (17), and having an activity which is substantially the sam as that of the 

5 polypeptide. 

(20) An antibody which recognizes the polypeptide of any one of (16) to (19). 

(21 ) A polypeptide array, comprising: 

at least one pol^eptide or partial fragment polypeptide selected from the polypeptides of (16) to (19) and 
10 partial fragment polypeptides of the polypeptides, and 

a solid support adhered thereto. 

(22) A polypeptide array, comprising: 

15 at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 

tides of (16) to (19) and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

(23) A system based on a computer for identifying a target sequence or a target structure motif derived from a 
20 corynefonm bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence Infomiation selected from SEQ ID N0S:1 
to 3501, and target sequence or target stmcture motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

25 (iii) a comparator that compares the at least one nucleotide sequence infomnation selected from SEQ ID NOS: 

1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence infomriation which is coincident with or analogous to th 
target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

30 

(24) A method based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

(i) inputting at least one nucleotide sequence infomriation selected from SEQ ID N0S:1 to 3501, target se- 
35 quence information or target structure motif infomriation into a user input device; 

(ii) at least temporarily storing said infomriation; 

(iii) comparing the at least one nucleotide sequence infomiation selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif infomriation; and 

(iv) screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
<0 target sequence or target structure motif infonmation. 

(25) A system based on a computer for identifying a target sequence or a target structure motif derived from a 
corynefomri bacterium, comprising the following: 

^ (i) a user input device that inputs at least one amino acid sequence infomiation selected from SEQ ID NOS: 

3502 to 7001, and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input infonmation; 

(iii) a comparator that compares the at least one amino acid sequence infonmation selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif infomiation, recorded by the data storage 

50 device for screening and analyzing amino acid sequence infomiation which is coincident with or analogous to 

the target sequence or target structure motif infomiation; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

(26) A method based on a computer for identifying a target sequence or a target structure motif derived from a 
55 coryneform bact rium, comprising the following: 

(i) inputting at least on amino acid sequenc infonmation select dfrom SEQ ID NOS:3502 to 7001, and target 
sequence infonmation or target structure motif information into a user input device; 
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(ii) at least temporarily storing said information; 

(iii) comparing the at least on amino acid sequenc information selected from SEQ ID NOS:3502 to 7001 
with the target sequence or target structure motif information; and 

(iv) screening and analyzing amino acid sequence information which Is coincident with or analogous to the 
target sequence or target structure motif information. 

(27) A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having 
a target nucleotide sequence derived from a coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence infonDatlon selected from SEQ ID N0S:2 
to 3501 , function infomnation of a polypeptide encoded by the nucleotide sequence, and target nucleotid 
sequence information; 

(ii) a data storage device for at least temporarily storing the input infomnation; 

(iii) a comparator that compares the at least one nucleotide sequence Infonmation selected from SEQ ID NOS: 
2 to 3501 with the target nucleotide sequence information, and determining a function of a polypeptide encoded 
by a polynucleotide having the target nucleotide sequence which is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID N0S:2 to 3501; and 

(iv) an output devices that shows a function obtained by the comparator. 

(28) A method based on a computer for detemnining a function of a polypeptide encoded by a polypeptide encoded 
by a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprising the 
following: 

(1) inputting at least one nucleotide sequence infomnation selected from SEQ ID NOS:2 to 3501, function in- 
fomnation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence information; 

(ii) at least temporarily storing said infomnation; 

(iii) comparing the at least one nucleotide sequence infonnation selected from SEQ ID N0S:2 to 3501 with 
the target nucleotide sequence infonnation; and 

(iv) detennlning a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 
from SEQ ID N0S:2 to 3501 . 

(29) A system based on a computer for detemnining a function of a polypeptide having a target amino acid sequenc 
derived from a corynefomn bacterium, comprising the following: 

(i) a user input device that Inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function information based on the amino acid sequence, and target amino acid sequence infor- 
mation; 

(ii) a data storing device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence infomnation selected from SEQ ID NOS: 
3502 to 7001 with the target amino acid sequence infomnation for detemnining a function of a polypeptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 

. one amino acid sequence selected from SEQ ID NOS:3502 to 7001 ; and 

(iv) an output device that shows a function obtained by the comparator. 

(30) A method based on a computer for determining a function of a polypeptide having a target amino add sequenc 
derived from a coryneform bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001, function 
. infomnation based on the amino acid sequence, and target amino acid sequence infomnation; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence infomnation s iected from SEQ ID NGS:3502 to 7001 
with the target amino acid s quence infonmatiori; and 

(iy) determining a function of a polypeptid having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3502 to 
7001. 

(31) The system according to any one of (23), (25), (27) and (29), wher in a corynefomn bacterium is a microor- 
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ganism of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium, 

(32) The m thod according to any one of (24), (26), (28) and (30). wherein a coryneform bacterium is a microor- 
ganism f th genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium, 

(33) The system according to (3 1 ), wherein the microorganism belonging to th g nus Corynebacterium is selected 
from th group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, corynebacterium callunae, corynet)acterium herculis, Corynebacterium lilium, Corynebacterium 
melassecofa, Corynebacterium thermoaminogenes, emd Corynebacterium ammoniagenes. 

(34) The method according to (32), wherein the microorganism belonging to the genus Corynebacterium is selected 
. from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 

acetoglutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacteri- 
um melassecoia, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(35) A recording medium or storage device which Is readable by a computer in which at least one nucleotide 
sequence information selected from SEQ ID N0S:1 to 3501 or function information based on the nucleotide se- 
quence is recorded, and is usable in the system of (23) or (27) or the method of (24) or (28). 

(36) A recording medium or storage device which is readable by a computer in which at least one amino acid 
sequence information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid 
sequence is recorded, and is usable in the system of (25) or (29) or the method of (26) or (30). 

(37) The recording medium or storage device according to 

(35) or (36), which is a computer readable recording medium selected from the group consisting of a floppy disc, 
a hard disc, a magnetic tape, a random access memory (RAM), a read only memory (ROM), a magneto-optic disc 
(MO). CD-ROM, CD-R, CD-RW, DVD-ROM. DVD-RAM and DVD-RW. 

(38) A polypeptide having a homoserlne dehydrogenase activity, comprising an amino acid sequence in which the 
Val residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a corynefomn 
bacterium is replaced with an amino acid residue other than a Val residue. 

(39) A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino 
acid sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

(40) The polypeptide according to (38) or (39), wherein the Val residue at the 59th position is replaced with an Ala 
residue. 

(41) A polypeptide having pyruvate carboxylase activity, -comprising an amino acid sequence in which the Pro 
residue at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform 
bacterium is replaced with an amino acid residue other than a Pro residue. 

(42) A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position in the amino 
acid sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

(43) The polypeptide according to (41) or (42). wherein the Pro residue at the 458th position is replaced with a Ser 
residue. 

(44) The polypeptide according to any one of (38) to (43), which is derived from Corynebacterium glutamicum. 

(45) A DNA encoding the polypeptide of any one of (38) to (44). 

(46) A recombinant DNA comprising the DNA of (45). 

(47) A transformant comprising the recombinant DNA of (46). 

(48) A transformant comprising in its chromosome the DNA of (45). 

(49) The transformant according to (47) or (48), which is derived from a corynefomri bacterium. ' 

(50) The transformant according to (49), which is derived from Corynebacterium glutamicum, 

(51 ) A method for producing L-lysine, comprising: 

cuituring the transformant of any one of (47) to (50) in a medium to produce and accumulate L-lysine in th 
medium, and 

recovering the L-lyslne from the culture. 

(52) A method for breeding a corynefomn bacterium using the nucleotide sequence information represented by 
SEQ ID N0S:1 to 3431, comprising the following: 

(i) comparing a nucleotide sequence of a genom or gene of a production strain deriv d a corynefonn bacte- 
rium which has been subjected to mutation breeding so as to produc at least one compound select d from 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a f mientation 
method, with a corresponding nucleotide sequence in SEQ ID N0S:1 to 3431; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (i); 

(iii) introducing the mutation point into a coryneform bacterium which is free of the mutation point; and 

(iv) examining productivity by th femientation method of the compound selected in (i) of the corynefomn 
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bacterium obtained in (iii). 

(53) Th method according to (52), wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
a signal transmission pathway. 

(54) The method according to (52), wher in th mutation point is a mutation point relating to a useful mutation 
which improves or stabilizes the productivity. 

(55) A method for breading a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID N0S:1 to 3431, comprising: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a corynefonm bact - 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with ia conresponding nucleotide sequence in SEQ ID N0S:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtain by (i); 

(iii) deleting a mutation point from a coryneform bacterium having the mutation point; and 

(iv) examining productivity by the femnentation method of the compound selected in (i) of the corynefomn 
bacterium obtained in (iii). 

(56) The method according to (55). wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
a signal transmission pathway. 

(57) The method according to (55), wherein the mutation point is a mutation point which decreases or destabilizes 
the productivity. 

(58) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID N0S:2 to 3431. comprising the following: 

(i) identifying an isozyme relating to biosynthesis of at least one compound selected from em amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 
quence information represented by SEQ ID N0S:2 to 3431 ; 

(ii) classifying the isozyme identified In (i) into an isozyme having the same activity; 

(iii) mutating all genes encoding the isozyme having the same activity simultaneously; and 

(iv) examining productivity by a fermentation method of the compound selected in (i) of the corynefonm bac- 
terium which have been transformed with the gene obtained in (iii), 

(59) A method for breeding a corynefonm bacterium using the nucleotide sequence information represented by 
SEQ ID N0S:2 to 3431. comprising the following: 

(i) an'anging a function infonmation of an open reading frame (ORF) represented by SEQ ID N0S:2 to 3431; 

(ii) allowing the arranged ORF to conrespond to an enzyme on a known biosynthesis or signal transmission 
pathway; . 

(iii) explicating an unknown biosynthesis pathway or signal transmission pathway of a coryneform bacterium 
in combination with infonmation relating known biosynthesis pathway or signal transmission' pathway of a co- 
rynefonm bacterium; 

(iv) comparing the pathway explicated in (iii) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a corynefonm bacterium based on the nucleotide sequence infonmation to either 
strengthen a pathway which is judged to be important in the biosynthesis of the target useful product in (iv) or 
weaken a pathway which is judged not to be important in the biosynthesis of the target useful product in (iv). 

(60) A corynefonm bacterium, bred by the method of any one of (52) to (59). 

(61) The corynefonm bacterium according to (60), which is a microorganism belonging to the genus Corynebac- 
terium, the genus Brevibacterium, or the genus Microbacterium, 

(62) The coryneform bacterium according to (61 ), wh rein the microorganism belonging to th genus Corynebao 
terium is selected from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, 
Corynebacterium acetoglutamicum, Corynebacterium callunae, Corynebacterium hercuUs, Corynebacterium iii' 
ium, Corynebacterium meiassecoia, Corynebacterium ttiermoaminogenes, and Corynebacterium ammoniagenes. 

(63) A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a 
saccharide, an organic acid and an analogu thereof, comprising: 

cuituring a coryneform bacterium of any one of (60) to (62) in a medium to produce and accumulate at least 
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one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof; 

recovering the compound from the culture. 

(64) Th method according to (63), wherein the compound is L-lysine. 

(65) A method for identifying a protein relating to useful mutation based on proteome analysis, comprising th 
following: 

(i) preparing 

a protein derived from a bacterium of a production strain of a corynefonm bacterium which has been sub- 
jected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(Hi) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

(iv) treating the protein showing different expression amounts as a result of the comparison with a peptides 
to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

As used herein, the tenm "proteome", which is a coined word by combining "protein" with "genome", refers to 
a method for examining of a gene at the polypeptide level. 

(66) The method according to (65), wherein the corynefomi bacterium is a microorganism belonging to the gequs 
Corynebactehum, the genus Brevibacterium, or the genus Microbacterium. 

(67) The method according to (66), wherein the microorganism belonging to the genus Corynebactenum is selected 
from the group consisting of Corynebactenum glutamicum, Corynebactenum acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium callunae, corynebacterium tierculis, Corynebacterium lilium Corynebacterium 
meiassecola, Corynebacterium tliermoaminogenes, and Corynebacterium ammoniagenes. 

(68) A biologically pure culture of Corynebacterium glutamicum AHP-3 (PERM BP-7382). 

[001 8] The present invention will be described below in more detail, based on the determination of the full nucleotid 
sequence of corynefonm bacteria. 

1 . Determination of full nucleotide sequence of corynefomi bacteria 

[0019] The term "corynefonm bacteria" as used herein means a microorganism belonging to the genus Corynebac- 
terium, the genus Brevibacterium or the genus Microbacterium as defined in Bergeys Manual of Determinative Bacte- 
no/ofly, ft 599 (1974). 

[0020] Examples include Corynebacterium acetoacidophiium, Corynebacterium acetoglutamicum, Corynebacterium 
cailunae, Corynebacterium glutamicum, Corynebacterium herculis, Corynebacterium lilium, Corynebacterium melas- 
secola, Corynebacterium thermoaminogenes, Brevibacterium saccharolyticum, Brevibacterium immariophilum, Brevi- 
bacterium roseum, Brevibacterium thiogenitalis, Microbacterium ammoniaphilum, and the like. 
[0021 ] Specific examples include Corynebacterium acetoacidophilum ATCC 1 3870, Corynebacterium acetoglutami- 
cum ATCC 15806, Corynebacterium callunae JKTCC 15991, Corynebacterium glutamicum ATCC 13032, Corynebac- 
terium glutamicum ATCC 13060, Corynebacterium glutamicum ATCC 13826 (prior genus and species: Brevibacterium 
flavum, or Corynebacterium lactofermentum), Corynebacterium glutamicum ATCC 14020 (prior genus and species: 
Brevibacterium divaricatum), Corynebacterium glutamicum ATCC 13869 (prior genus and species: Brevibacterium 
lactofermentum), Corynebacterium hercuiis ATCC 13868, Corynebacterium lilium ATCC 15990, Corynebacterium 
melassecolaPJCC 17965, Corynebacterium thennoamlnogenesFERM 9244, Brevibacterium saccharolyticum P^CC 
14066, Brevibacterium immarioptiilumAJCC 14068, Brevibacterium roseum PJCC 13825, Brevibacterium thiogenitalis 
ATCC 19240, Microbacterium ammoniaphilum fiJCC 15354. and the like. 
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(1) Preparation of genome DNA of coryneform bacteria 

[0022] Coryneform bacteria can be cultured by a conventional method. 

[0023] Any of a . natural medium and a synthetic medium can be used, so long as it is a medium suitable for fficlent 
5 culturing of th microorganism, and it contains a carbon source, a nitrog n source, an inorganic salt, and the lik which 
can be assimilated by the microorganism. 

[0024] In Corynebacterium glutamicum, for example, a BY medium (7 g/l meat extract, 10 g/l peptone, 3 g/l sodium 
chloride, 5 g/l yeast extract, pH 7.2) containing 1% of glycine and the like can be used. The culturing is earned out at 
25 to 35''C ovemlght. 

10 [0025] After the completion of the culture, the cells are recovered from the culture by centrifugation. The resulting 
cells are washed with a washing solution. 

[0026] Examples of the washing solution include STE buffer (1 0.3% sucrose, 25 mmol/1 Trts hydrochloride, 25 mmol/ 
I ethylenediaminetetraacetic acid (hereinafter referred to as "EDTA"), pH 8.0), and the like. 
[0027] Genome DNA can be obtained from the washed cells according to a conventional method for obtaining ge- 
15 nome DNA, namely, lysing the cell wall of the cells using a lysozyme and a surfactant (SDS, etc), eliminating proteins 
and the like using a phenol solution and a phenol/chloroform solution, and then precipitating the genome DNA with 
ethanol or the like. Specifically, the following method can be illustrated. 

[0028] The washed cells are suspended In a washing solution containing 5 to 20 mg/l lysozyme. After shaking, 5 to 
20% SDS is added to lyse the cells. In usual, shaking Is gently perfomned at 25 to 40^0 for 30 minutes to 2 hours. After 
20 shaking, the suspension is maintained at 60 to 70°C for 5 to 1 5 minutes for the lysis. 

[0029] After the lysis, the suspension is cooled to ordinary temperature, and 5 to 20 ml of Tris-neutralized phenol is 
added thereto, followed by gently shaking at room temperature for 15 to 45 minutes. 

[0030] After shaking, centrifugation (15,000 x g, 20 minutes, 20^*0) is earned out to fractionate the aqueous layer. 
[0031] After performing extraction with phenol/chloroform and extraction with chloroform (twice) in the same manner, 

25 3 mol/l sodium acetate solution (pH 5.2) and Isopropanol are added to the aqueous layer at 1/10 times volume and 2 
times volume, of the aqueous layer, respectively, followed by gently stining to precipitate the genome DNA. 
[0032] The genome DNA is dissolved again in a buffer containing 0.01 to 0.04 mg/ml RNase. As an example of th 
buffer, TE buffer (10 mmol/l Tris hydrochloride, 1 mol/l EDTA, pH 8.0) can be used. After dissolving, the resultant 
solution is maintained at 25 to 40^C for 20 to 50 minutes and then extracted successively with phenol, phenol/chlorofomn 

30 and chloroform as in the above case. 

[0033] After the extraction, isopropanol precipitation is canied out and the resulting DNA precipitate is washed with 
70% ethanol, followed by air drying, and then dissolved in TE buffer to obtain a genome DNA solution. 

(2) Production of shotgun library 

35 

[0034] A method for produce a genome DNA library using the genome DNA of the corynefonn bacteria prepared in 
the above (1 ) include a method described in Molecular Cloning, A laboratory Manual, Second Edition (1 989) (hereinafter 
referred to as "Molecular Cloning, 2nd ed."). In particular, the following method can be exemplified to prepare a genom 
DNA library appropriately usable in detemnining the full nucleotide sequence by the shotgun method. 
40 [0035] To 0.01 mg of the genome DNA of the corynefomn bacteria prepared in the above (1) , a buffer, such as TE 
buffer or the like, is added to g'tve a total volume of 0.4 ml. Then, the genome DNA is digested into fragments of 1 to 
10 kb with a sonicator (Yamato Powersonic Model 50). The treatment with the sonicator is performed at an output of 
20 continuously for 5 seconds. 

[0036] The resulting genome DNA fragments are blunt-ended using DNA blunting kit (manufactured by Takara Shuzo) 
45 or the like. 

[0037] The blunt-ended genome, fragments are fractionated by agarose gel or polyacrylamide gel electrophoresis 
and genome fragments of 1 to 2 kb are cut out from the gel. 

[0038] To the gel, 0.2 to 0.5 ml of a buffer for eluting DNA, such as MG elution buffer (0.5 mol/l ammonium acetate, 
10 mmol/1 magnesium acetate, 1 mmol/l EDTA, 0.1% SDS) or the like, is added, followed by shaking at 25 to 40*'C 
50 overnight to elute DNA. 

[0039J The resulting DNA eluate is treated with phenol/chloroform and th n precipitated with ethanol to obtain a 
genome library insert. 

[0040] This insert is ligated into a suitable vector, such as pUCI 8 S/nal/SAP (manufactured by Amersham Pharmacia 
Biotech) or the lik . using T4 ligas (manufactured by Takara Shuzo) or the like. Th ligation can be carried out by 
55 allowing a mixture to stand at 1 0 to 20^C for 20 to 50 hours. 

[0041] The resulting ligation product is precipitated with ethanol and dissolved in 5 to 20 ^.1 of TE buffer. 

[0042] Escherichia coli is transfonned in accordanc with a conventional method using 0.5 to 2 ^1 of the ligation 

solution. Examples of the transformation method include the electroporatton method using ELECTRO MAX DHIOB 
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(manufactured by Lif Technologies) for Escherichia coll The electroporation method can be canied out under the 
conditions as described in th manufacturer's instructions. 

[0043] The transformed Escherichia coli'is spread on a suitabi s lection medium containing agar, for example, LB 
plate medium containing 10 to 100 mg/l ampicillin (LB medium (10 g/l bactotrypton, 5 g/l yeast extract, 10 g/l sodium 
5 chloride, pH 7.0) containing 1 .6% of agar) when pUCI 8 Is used as the cloning vector, and cultured therein. 

[0044] The transfonnant can be obtained as colonies fomned on the plate medium. In this step, it is possible to select 
the transfonnant having the recombinant DNA containing the genome DNA as white colonies by adding X-gal and 
IPTG (tsopropyl-p-thiogalactopyranoside) to the plate medium. 

[0045] The transfonnant is allowed to stand for culturing in a 96-well titer plate to which 0.05 ml of the LB medium 
10 containing 0.1 mg/ml of ampicillin has been added in each well. The resulting culture can be used in an experiment of 
(4) described below. Also, the culture solution can be stored at -SO^C by adding 0.05 ml per well of the LB medium 
containing 20% glycerol to the culture solution, followed by mixing, and the stored culture solution can be used at any 
time. 

15 (3) Production of cosmid library 

[0046] The genome DNA (0.1 mg) of the coryneform bacteria prepared in the above (1) is partially digested with a 
restriction enzyme, such as Sau3AI or the like, and then ultracentrifuged (26,000 rpm, 18 hours, 20''C) under a 10 to 
40% sucrose density gradient using a 10% sucrose buffer (1 mol/l NacI, 20 mmol/l Tris hydrochloride, 5 mmol/I EDTA, 
20 1 0% sucrose, pH 8.0) and a 40% sucrose buffer (elevating the concentration of the 1 0% sucrose buffer to 40%). 

[0047] After the centrifugation, the thus separated solution Is fractionated into tubes in 1 ml per each tube. After 
confinning the DNA fragment size of each fraction by agarose gel electrophoresis, a fraction rich in DNA fragments of 
about 40 Icb Is precipitated with ethanol. 

[0048] The resulting DNA fragment is ligated to a cosmid vector having a cohesive end which can be ligated to th 
25 fragment. When the genome DNA is partially digested with Sau3AI, the partially digested product can be ligated to, 
for example, the BamHl site of superCosI (manufactured by Stratagene) in accordance with the manufacture's instruc- 
tions. 

[0049] The resulting ligation product is packaged using a packaging extract which can be prepared by a method 
described in Molecular Cloning, 2nd ed. and then used in transforming Escherichia coil More specifically, the ligation 
30 product is packaged using, for example, a commercially available packaging extract, Gigapack III Gold Packaging 
Extract (manufactured by Stratagene) in accordance with the manufacture's instmctions and then introduced into £s- 
cherichia co// XL-1-BlueMR (manufactured by Stratagene) or the like. 

[0050] The thus transfomned Escherichia coii is spread on an LB plate medium containing ampicillin, and cultured 
therein. 

35 [0051] The transformant can be obtained as colonies fonned on the plate medium. 

[0052] The transformant is subjected to standing culture in a 96-well titer plate to which 0.05 ml of the LB medium 
containing 0.1 mg/ml ampicillin has been added. 

[0053] The resulting culture can be employed in an experiment of (4) described below. Also, the culture solution can 
be stored at -dO^'C by adding 0.05 ml per well of the LB medium containing 20% glycerol to the culture solution, followed 
40 by mixing, and the stored culture solution can be used at any time. 

(4) Determination of nucleotide sequence 

(4-1) Preparation of template 

45 

[0054] The full nucleotide sequence of genome DNA of coryneform bacteria can be determined basically according 
to the whole genome shotgun method (Science, 269: 496-512 (1995)). 

[0055] The template used in the whole genome shotgun method can be prepared by PGR using the library prepared 
In the above (2) {DNA Research, 5: . 1 -9 (1 998)). 
50 [0056] Specificaliy, the template can be prepared as follows. 

[0057] The clone derived from the whole genome shotgun library is inoculated by using a replicator (manufactured 
by GENETIX) into each well of a 96-well plate to which 0.08 ml per well of the LB medium containing 0. 1 mg/ml ampicillin 
has been added, followed by stationarily culturing at 3TC ovemight. 

[0058] Next, th culture solution is transport d, using a copy plat (manufactured by Tokken), into each well of a 
55 96-well reaction plate (manufactured by PE Biosystems) to which 0.025 ml per well of a PGR reaction solution has 
been added using TaKaRa Ex Taq (manufactured by Takara Shuzo). Then, PGR is canied out in accordance with th 
protocol by Makino et a/. (DNA Research, 5: 1-9 (1998)) using GeneAmp PGR Syst m 9700 (manufactured by PE 
Biosystems) to amplify th inserted fragments. 
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[0059] The excessive primers and nucleotid s ar eliminated using a kit for purifying a PGR product, and the product 
is used as the template in th sequencing reaction. 

[0060] It Is also possible tod temriine the nucleoticle sequence using a double-stranded DNAplasmid as at mplat . 

[0061] The double-stranded DNA plasmld used as the template can be obtained by the following method. 
5 [0062] The clone deriv d from the whole genome shotgun library is inoculated into each well of a 24- or 96-w II plate 

to which 1.5 ml per well of a 2 x YT medium (16 g/l bactotrypton, 10 g/l yeast extract, 5 g/l sodium chloride, pH 7.0) 

containing 0.05 mg/ml ampicillin has been added, followed by culturing under shaking at 3TC ovemight. 

[0063] The double-stranded DNA plasmid can be prepared from the culture solution using an automatic plasmld 

preparing machine KURABO PI-50 (manufactured by Kurabo Industries), a multiscreen (manufactured by Millipor ) 
10 or the like, according to each protocol. 

[0064] To purify the plasmid, Blomek 2000 manufactured by Beckman Coulter and the like can be used. 

[0065] The resulting purified double-stranded DNA plasmid is dissolved in water to give a concentration of about 0. 1 

mg/ml. Then, it can be used as the template in sequencing. 

IS (4-2) Sequencing reaction 

[0066] The sequencing reaction can be carried out according to a commercially available sequence kit or the like. A 
specific method is exemplified below. 

[0067] To 6 (il of a solution of ABI PRISM BIgDye Terminator Cycle Sequencing Ready Reaction Kit (manufactured 
^ by PE Biosystems), 1 to 2 pmol of an M13 regular direction primer (M13-21) or an MIS reverse direction primer 
(MI3REV) (DNA Research, 5: 1-9 (1998)) and 50 to 200 ng of the template prepared in the above (4-1) (the PCR 
product or plasmld) to give 10 of a sequencing reaction solution. 

[0068] A dye temninator sequencing reaction (35 to 55 cycles) is carried out using this reaction solution and GeneAmp 
PCR System 9700 (manufactured by PE Biosystems) or the like. The cycle parameter can be determined In accordanc 
25 wHh a commercially available kit, for example, the manufacture's instructions attached with ABI PRISM Big Dye Ter- 
minator Cycle Sequencing Ready Reaction Kit. 

[0069] The sample can be purified using a commercially available product, such as Multi Screen HV plate (manu- 
factured by Millipore) or the like, according to the manufacture's instructions. 

[0070] The thus purified reaction product is precipitated with ethanol, dried and then used for the analysis. The dried 
30 reaction product can be stored in the dark at -30°C and the stored reaction product can be used at any time. 

[0071] The dried reaction product can be analyzed using a commercially available sequencer and an analyzer ac- 
cording to the manufacture's instructions. 

[0072] Examples of the commercially available sequencer include ABI PRISM 377 DNA Sequencer (manufactured 
by PE Biosystems). Example of the analyzer include ABI PRISM 3700 DNA Analyzer (manufactured by PE Biosystems). 

35 

(5) Assembly 

[0073] A software, such as phred (The University of Washington) or the like, can be used as base call for use in 
analyzing the sequence infomnation obtained in the above (4). A software, such as Cross.Match (The University of 
"fo Washington) or SPS Cross.Match (manufactured by Southwest Parallel Software) or tiie like, can be used to mask 
the vector sequence information. 

[0074] For the assembly, a software, such as phrap (The University of Washington), SPS phrap (manufactured by 
Southwest Parallel Software) or the like, can be used. 

[0075] In the above, analysis and output of the results thereof, a computer such as UNIX, PC, Macintosh, and the 
^ like can be used. 

[0076] Contig obtained by the assembly can be analyzed using a graphical editor such as consed (The University 
of Washington) or the like. 

[0077] It is also possible to perform a series of the operations from the base call to the assembly in a lump using a 
script phredPhrap attached to the consed. 
so [0078] As used herein, software will be understood to also be referred to as a comparator. 

(6) Determination of nucleotide sequence in gap part 

[0079] Each of the cosmids in the cosmid library constaicted in the above (3) is prepared In the sam manner as in 
S5 the preparation of the double-stranded DNA plasmid described in the above (4-1 ). The nucleotid sequence at the end 
of th Insert fragment of the cosmid is detenmlned using a commercially availabi kit, such as ABI PRISM BigDye 
Tenminator Cycle Sequencing Ready R action Kit (manufactured by PE Biosystems) according to the manufactur 's 
instructions. 
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[0080] About 800 cosmtd clones are sequ need at both ends of the inserted fragment to d tect a nucleotide sequence 
in the contig derived from the shotgun sequencing obtained in (5) which is coincident with the sequence. Thus, the 
chain linkage between respective cosmid clones and r spective contigs are clarified, and mutual alignment is carried 
out. Furthennore, th results are compared with known physical maps to map the cosmids and the contigs. In cas of 
5 Corynebacterium glutamicum AJCC 13032, a physical map of MoL Gen. Genet, 252. 255-265 (1996) can b used. 
[0081] The sequence in the region which cannot be covered with the contigs (gap part) can be detemnlned by the 
following method. 

[0082] Clones containing sequences positioned at the ends of the contigs are selected. Among these, a clone wherein 
only one end of the inserted fragment hasf been determined Is selected and the sequence at the opposite end of th 
^0 inserted fragment is detemnined. 

[0083] A shotgun library clone or a cosmid clone derived therefrom containing the sequences at the respective ends 
of the inserted fragments in the two contigs is identified and the full nucleotide sequence (5t the inserted fragment of 
the clone is detemnined. 

[0084] According to this method, the nucleotide sequence of the gap part can be detennined. 
15 [0085] ' When no shotgun library clone or cosmid clone covering the gap part is available, primers complementary to 

the end sequences of the two different contigs are prepared and the DNA fragment in the gap part is amplified. Then, 

sequencing is perfomned by the primer walking method using the amplified DNA fragment as a template or by the 

shotgun method in which the sequence of a shotgun clone prepared from the amplified DNA fragment is detemnined. 

Thus, the nucleotide sequence of the above-described region can be determined. 
20 [0086] In a region showing a low sequence accuracy, primers are synthesized using AUTOFINISH function and 

NAVIGATING function of consed (The University of Washington), and the sequence is detemnined by the primer walking 

method to improve the sequence accuracy. 

[0087] Examples of the thus detemnined nucleotide sequence of the full genome include the full nucleotide sequence 
of genome of Corynebacterium glutamicum ATCC 13032 represented by SEQ ID N0:1 . 

25 

(7) Detemnination of nucleotide sequence of microorganism genome DNA using the nucleotide sequence represented 
by SEQ ID NO:1 

[0088] A nucleotidesequence of a polynucleotide having a homology of 80% or more with the full nucleotide sequenc 
30 of Corynebacterium glutamicum ATCC 13032 represented by SEQ ID NO:1 as detemnined above can also be deter- 
mined using the nucleotide sequence represented by SEQ ID N0:1, and the polynucleotide having a nucleotide se- 
quence having a homology of 80% or more with the nucleotide sequence represented by SEQ ID N0:1 of the present 
invention is within the scope of the present invention. The term "polynucleotide having a nucleotide sequence having 
a homology of 80% or more with the nucleotide sequence represented by SEQ ID N0:1 of the present invention" is a 
35 polynucleotide in which a full nucleotide sequence of the chromosome DNA can be detemnined using as a primer an 
oligonucleotide composed of continuous 5 to 50 nucleotides in the nucleotide sequence represented by SEQ ID NO: 

1 , for example, according to PGR using the chromosome DNA as a template. A particulariy prefenred primer in det r- 
mination of the full nucleotide sequence is an oligonucleotide having nucleotide sequences which are positioned at 
the inten/al of about 300 to 500 bp, and among such oligonucleotides, an oligonucleotide having a nucleotide sequence 

40 selected from DNAs encoding a protein relating to a main metabolic pathway is particulariy preferred. The polynucle- 
otide in which the full nucleotide sequence of the chromosome DNA can be determined using the oligonucleotide 
includes polynucleotides constituting a chromosome DNA derived from a microorganism belonging to coryneform bac- 
teria. Such a polynucleotide is preferably a polynucleotide constituting chromosome DNA derived from a microorganism 
belonging to the genus Corynebacterium, more preferably a polynucleotide constituting a chromosome DNA of Co- 

45 rynebacterium glutamicum. 

2. Identification of ORF (open reading frame) and expression regulatory fragment and detemnination of the function of 
ORF 

so [0089] Based on the full nucleotide sequence data of the genome derived from coryneform bacteria detemnined in 
the at>ove item 1 , an ORF and an expression modulating fragm nt can b identified. Furthemnore, the function of the 
thus detemnined ORF can be determined. 

[0090] The ORF means a continuous region in the nucleotide s quence of mRNA which can b translated as an 
amino acid sequence to mature to a protein. A region of the DNA coding for the ORF of mRNA is also called ORF 
S5 [0091] The expression modulating fragm nt (hereinafter ref nred to as "EMF") Is used herein to define a series of 
polynucleotide fragments which modulat the expression of the ORF or another sequence ligated operatabty thereto. 
The xpression "modulate th expression of a sequenc ligated operatabl/* is used herein to ref r to changes in the 
expression of a sequence due to the presenc of the EMF. Examples of th EMF include a promoter, an operator, an 
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enhancer, a silencer, a ribosome-binding sequence, a transcriptional t rmination sequence, and the (ike. In corynef onm 
bacteria, an EMF is usually present in an interg nic segment (a fragment positioned between two genes; about IQto 
200 nucleotides in length). Accordingly, an EMF is frequently pres nt in an intergenic segment of 10 nucleotides or 
longer. It Is also possibi to determine ordiscoverthe presence of an EMF by using known EMF sequ ncesasatarg t 
5 s quence or a target structural motif (or a target motif) using an appropriate software or comparator, such as FASTA 
(Proc. Nati. Acad, ScL USA, 85: 2444-48 (1988)), BLAST (J. MoL BioL, 275:403-410 (1990)) or the like. Also, it can 
be identified and evaluated using a known EMF-capturing vector (for example, pKK232-8; manufactured by Amersham 
Pharmacia Biotech). 

. [0092] The tenm "target sequence" is used herein to refer to a nucleotide sequence composed of 6 or more nucle- 
ic otides, an amino acid sequence composed of 2 or more amino acids, or a nucleotide sequence encoding this amino 
acid sequence composed of 2 or more amino acids. A longer target sequence appears at random in a data base at 
the lower possibility. The target sequence is preferably about 10 to 100 amino acid residues or about 30 to 300 nude* 
otide residues. 

[0093] The tenm "target structural motif" or "target motif is used herein to refer to a sequence or a combination of 
15 sequences selected optionally and reasonably. Such a motif is selected on the basis of the threedimensional structur 
fonmed by the folding of a polypeptide by means known to one of ordinary skill in the art. Various motives are known. 
[0094] Examples of the target motif of a polypeptide include, but are not limited to, an enzyme activity site, a protein- 
protein interaction site, a signal sequence, and the like. Examples of the target motif of a nucleic acid include a promoter 
sequence, a transcriptional regulatory factor binding sequence, a hair pin structure, and the like. 
20 [0095] Examples of highly useful EMF include a high-expression promoter, an inducible-expression promoter, and 
the like. Such an EMF can be obtained by positionally determining the nucleotide sequence of a gene which is known 
or expected as achieving high expression (for example, ribosomal RNA gene: GenBank Accession No. M16175 or 
Z46753) or a gene showing a desired induction pattern (for example, isocitrate lyase gene induced by acetic acid: 
Japanese Published Unexamined Patent Application No. 56782/93) via the alignment with the full genome nucleotid 
25 sequence determined in tiie above item 1 , and isolating the genome fragment in the upstream part (usually 200 to 500 
nucleotides from the translation initiation site). It is also possible to obtain a highly useful EMF by selecting an EMF 
showing a high expression efficiency or a desired induction pattern from among promoters captured by the EMF- 
capturing vector as described above. 

[0096] The ORF can be identified by extracting characteristics common to individual ORFs, constructing a general 
30 model based on these characteristics, and measuring the conformity of the subject sequence with the model. In th 
identification, a software, such as GeneMark (Nuc, Acids, Res., 22. 4756-67 (1994): manufactured by GenePro)), 
GeneMarichmm (manufactured by GenePro), GeneHacker {Protein, Nucieic Acid and Enzyme, 42. 3001-07 (1997)), 
Glimmer (Nuc. Acids. Res., 2$: 544-548 (1998): manufactured by The Institute of Genomic Research), or the like, can 
be used. In using the software, the default (initial setting) parameters are usually used, though the parameters can be 
35 optionally changed. 

[0097] In the above-described comparisons, a computer, such as UNIX, PC, Macintosh, or the like, can be used. 
[0098] Examples of the ORF detenmined by the method of the present invention include ORFs having the nucleotid 
sequences represented by SEO ID N0S:2 to 3501 present in the genome of Corynebacterium glutamicum as repre- 
sented by SEQ ID Nb:1 . In these ORFs, polypeptides having the amino acid sequences represented by SEQ ID NOS: 

40 3502 to 7001 are encoded. 

[0099] The function of an ORF can be detemnined by comparing the identified amino acid sequence of the ORF with 
known homologous sequences using a homology searching software or comparator, such as BLAST, FAST, Smith & 
Watemian (i^eth. Enzym., 164: 765 (1988)) or the like on an amino acid data base, such as Swith-Prot, PIR, GenBank- 
nr-aa, GenPept constituted by protein-encoding domains derived from GenBank data base, OWL or the like. 

^ [01 00] Furthermore, by tiie homology searching, the identity and similarity with the amino acid sequences of known 
proteins can also be analyzed. 

[0101] With, respect of the tenn "identity" used herein, where two polypeptides each having 10 amino acids are 
different in the positions of 3 amino acids, these polypeptides have an identity of 70% with each other. In case wherein 
one of the different 3 amino acids is analogue (for example, leucine and isoleucine), these polypeptides have a similarity 
50 of 80%. 

[0102] As a specific example, Table 1 shows the r gistration numbers in known data bases of sequences which are 
judged as having the highest similarity with the nucleotide sequence of tiie ORF derived from Corynebacterium glutami- 
cum ATCC 13032, genes of these sequenc s, functions of th se genes, and identities thereof compar d with known 
amino aci^l translation sequences. 
55 [0103] Thus, a great number of novel genes derived from corynef orm bacteria can be identified by determining tiie 
full nucleotide sequence of the genome derived from corynefonm bacterium by the means of the present invention. 
Moreover, the function of tiie prot ins encoded by these genes can be determined. Since corynefomi bacteria are 
industrially highly useful microorganisms, many f the identified genes are industrially useful. 
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[0104] Moreover, the characteristics of respective microorganisms can be clarified by classifying the functions thus 
determined. As a result, valuable infomnation in bre ding is obtained. 

[0105] Furthemriore, from th ORF infomnation derived from corynefomfi bacteria, the ORF corresponding to the 
microorganism is prepared and obtained according to th general method as disclosed in Molecular Cloning, 2nd ed. 

5 or the like. Specifically, an oligonucleotide having a nucleotide sequence adjacent to the ORF is synthesized, and the 
ORF can be isolated and obtained using the oligonucleotide as a primer and a chromosome DNA derived from co- 
rynefomi bacteria as a template according to the general PGR cloning technique. Thus obtained ORF sequences 
include polynucleotides comprising the nucleotide sequence represented by any one of SEQ ID N0S:2 to 3501 . 
[0106] The ORF or primer can be prepared using a polypeptide syntfiesizer based on the above sequence informa- 

10 tion. 

[0107] Examples of the polynucleotide of the present invention include a polynucleotide containing the nucleotide 
sequence of the ORF obtained in the above, and a polynucleotide which hybridizes with* the polynucleotide under 
stringent conditions. 

[0108] The polynucleotide of the present Invention can be a single-stranded DNA, a double-stranded DNA and a 

'5 single-stranded RNA, though it is not limited thereto. 

[01 09] The polynucleotide which hybridizes with the polynucleotide containing the nucleotide sequence of the ORF 
obtained in the above under stringent conditions includes a degenerated mutant of the ORF. A degenerated mutant is 
a polynucleotide fragment having a nucleotide sequence which is different from the sequence of the ORF of the present 
invention which encodes the same amino acid sequence by degeneracy of a gene code. 

20 [01 10] Specific examples include a polynucleotide comprising the nucleotide sequence represented by any one of 
SEQ ID N0S:2 to 3431, aind a polynucleotide which hybridizes with the polynucleotide under stringent conditions. 
[01 1 1] A polynucleotide which hybridizes under stringent conditions is a polynucleotide obtained by colony hybridi- 
zation, plaque hybridization, Soutfiem blot hybridization or the like using, as a probe, the polynucleotide having the 
nucleotide sequence of the ORF identified in the above. Specific examples include a polynucleotide which can be 

25 identified by carrying out hybridization at es^'C in the presence of 0.7-1 .0 M NaCI using a filter on which a polynucleotid 
prepared from colonies or plaques is immobilized, and then washing the filter with 0.1x to 2x SSC solution (the com- 
position of Ix SSC contains 150 mM sodium chloride and 16 mM sodium citrate) at 65°C. 

[01 1 2] The hybridization can be earned out in accordance with known methods described in, for example. Molecular 

Cloning, 2nd ed., Cunrent Protocols in Molecular Biology, DNA Cloning 1: Core Techniques, A Practical Approach, 
30 Second Edition, Oxford University (1 995) or tiie like. Specific examples of the polynucleotide which can be hybridized 

include a DNA having a homology of 60% or more, preferably 80% or more, and particulariy preferably 95% or mor . 

with the nucleotide sequence represented by any one of SEQ ID N0:2 to 3431 when calculated using default (initial 

setting) parameters of a homology searching software, such as BLAST, FASTA, Smith-Waterman or the like. 

[01 1 3] Also, the polynucleotide of the present invention includes a polynucleotide encoding a polypeptide comprising 
35 the amino acid sequence represented by any one of SEQ ID NOS:3502 to 6931 and a polynucleotide which hybridizes 

with the polynucleotide under stringent conditions. 

[0114] Furtiiermore, tiie polynucleotide of the present invention includes a polynucleotide which is present in the 5* 
upstream or 3' downstream region of a polynucleotide comprising tiie nucleotide sequence off any one of SEQ ID NOS: 
2 to 3431 in a polynucleotide comprising the nucleotide sequence represented by SEQ ID N0:1, and has an activity 
of regulating an expression of a polypeptide encoded by the polynucleotide. Specitic examples of the polynucleotide 
having an activity of regulating an expression of a polypeptide encoded by the polynucleotide Inciudes'a polynucleotide 
encoding the above described EMF, such as a promoter, an operator, an enhancer, a silencer, a ribosome-binding 
sequence, a transcriptional termination sequence, and the like. 

[0115] The primer used for obtaining the ORF according to ttie above PGR cloning technique includes an oligonu- 
45 cleotide comprising a sequence which is the same as a sequence of 1 0 to 200 continuous nucleotides in the nucleotide 
sequence of the ORF and an adjacent region or an oligonucleotide comprising a sequence which is complementary 
to the oligonucleotide. Specific examples include an oligonucleotide comprising a sequence which is the same as a 
sequence of 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID N0S:1 
to 3431 , and an oligonucleotide comprising a sequence complementary to the oligonucleotide comprising a sequence 
50 off at least 1 0 to 20 continuous nucleotide off any one off SEQ ID NOS: 1 to 3431 . When the primers are used as a sense 
primer and an antisense primer, th above-described oligonucleotides in which melting temperature (T^^) and the 
number of nucleotides are not significantly different from each other are preferred. 

[01 16] Th oligonucleotide off tiie present invention includes an oligonucleotide comprising a sequence which is th 
same as 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID N0S:1 to 
55 3431 or an oligonucleotide comprising a sequence complementary to the oligonucleotid . 

[0117] Also, analogues of these oligonucleotides (hereinafter also ref n^ed to as "analogous oligonucleotides") are 
also provided by the present invention and are useful in the methods described herein. 

[0118] Examples off tiie analogous oligonucleotides include analogous oligonucleotides in which a phosphodiester 
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bond in an oligonucleotide is converted to a phosphorothioat bond, analogous oligonucleotides in which a phosphodi- 
ester bond in an oligonucleotide is converted to an N3'-P5' phosphoamidate bond, analogous oiigonucleotid s in which 
ribose and a phosphodiester bond in an oligonucleotide is converted to a peptide nucleic acid bond, analogous oligo- 
nucleotides in which uracil in an oligonucleotide is replaced with C-5 propynyluracil, analogous oligonucleotides in 
which uracil in an oligonucleotide is replaced with 05 thiazoluracil, analogous oligonucleotides in which cytosine in 
an oligonucleotide is replaced with C-5 propynylcytosine. analogous oligonucleotides in which cytosine in an oligonu- 
cleotide is replaced with phenoxazine-modtfied cytosine, analogous oligonucleotides in which ribose in an oligonucle- 
otide Is replaced with 2'-0-propylribose, analogous oligonucleotides in which ribose in an oligonucleotide is replaced 
with 2'-methoxyethoxyribose, and the like (Cell Engineering, 16: 1463 (1997)). 

[0119] The above oligonucleotides and analogous oligonucleotides of the present invention can be used as probes 
for hybridization and antisense nucleic acids described below in addition to as primers. 

[0120] Examples of a primer for the antisense nucleic acid techniques known in the art fnclude an oligonucleotide 
which hybridizes the oligonucleotide of the present invention under stringent conditions and has an activity regulating 
expression of the polypeptide encoded by the polynucleotide, in addition to the above oligonucleotide. 

3. Determination of isozymes 

[0121] Many mutants of coryneform bacteria which are useful in the production of useful substances, such as amino 
acids, nucleic acids, vitamins, saccharides, organic acids, and the like, are obtained by the present invention. 
[0122] However, since the gene sequence data of the microorganism has been, to date, insufficient, useful mutants 
have been obtained by mutagenic techniques using a mutagen, such as nitrosoguanidine (NTG) or the like. 
[0123] Although genes can be mutated randomly by the mutagenic method using the above-described mutagen, all 
genes encoding respective isozymes having similar properties relating to the metabolism of intermediates cannot be 
mutated. In the mutagenic method using a mutagen, genes are mutated randomly. Accordingly, hannful mutations 
worsening culture characteristics, such as delay in growth, accelerated foaming, and the like, might be imparted at a 
great frequency, in a random manner. 

[01 24] However, if gene sequence inf omnation is available, such as is provided by the present invention, it is possible 
to mutate all of the genes encoding target isozymes. In this case, harmful mutations may be avoided and the target 
mutation can be incorporated. 

[0125] Namely, an accurate number and sequence infonmation of the target isozymes in coryneform bacteria can be 
obtained based on the ORF data obtained in the above item 2. By using the sequence information, all of the target 
isozyme genes can be mutated into genes having the desired properties by, for example, the site-specific mutagenesis 
method described in Molecular Cloning, 2nd ed. to obtain useful mutants having elevated productivity of useful sub- 
stances. 

4^ Clarification or detemnination of biosynthesis pathway and signal transmission pathway 

[01 26] Attempts have been made to elucidate biosynthesis pathways and signal transmission pathways in a number 
of organisms, and many findings have been reported. However, there are many unknown aspects of corynefonm bac- 
teria since a number of genes have not been identified so far. 
[01 27] These unknown points can be clarified by the following method. 

[0128] The functional information of ORF derived from corynefomn bacteria as identified by the method of above item 
2 is arranged. The term "arranged" means that the ORF is classified based on the biosynthesis pathway of a substance 
or the signal transmission pathway to which the ORF belongs using known information according to the functional 
information. Next, the arranged ORF sequence information is compared with enzymes on the biosynthesis pathways 
or signal transmission pathways of other known organisms. The resulting infonmation is combined with known data on 
coryneform bacteria. Thus, the biosynthesis pathways and signal transmission pathways in coryneform bacteria, which 
have been unknown so far, can be determined. 

[0129] As a result that these pathways which have been unknown or unclear hitherto are clarified, a useful mutant 
for producing a target useful substance can be efficiently obtained. 

[0130] When the thus clarified pathway is judged as important in the synthesis of a useful product, a useful mutant 
can be obtained by selecting a mutant wherein this pathway has been strengthened. Also, when the thus clarified 
pathway is Judged as not important in the biosynthesis of the target useful product, a useful mutant can be obtained 
by s lecting a mutant wherein the utilization frequency of this pathway is low red. 

5. Clarification or determination of useful mutation point 

[0131] Many useful mutants of coryneform bacteria which are suitable for the production of useful substances, such 
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as amino acids, nucleic acids, vitamins, saccharid s, organic acids, and the lilc , have been obtained. However, it Is 
hardly known which mutation point is imparted to a gen to impr ve the productivity. 

[0132] However, mutation points contained In production strains can be identified by comparing desired sequenc s 
of the genome DNA of the production strains obtained from coryneform bacteria by the mutagenic technique with the 
nucleotide sequences of the corresponding genom DNA and ORF derived from coryneform bacteria detemriined by 
the methods of the above items 1 and 2 and analyzing them 

[0133] Moreover, effective mutation points contributing to the production can be easily specified from among thes 
mutation points on the basis of known infonnation relating to the metabolic pathways, the metabolic regulatory mech- 
anisms, the structure activity correlation of enzymes, and the like. 

[0134] When any efficient mutation can be hardly specified based on known data, the mutation pointe thus identtfi d 
can be introduced into a wild strain of corynefonm bacteria or a production strain free of the mutation. Then, it is exarftined 
whether or not any positive effect can be achieved on the production. 

[0135] For example, by comparing the nucleotide sequence of homoserine dehydrogenase gene horn of a lysine- 
producing B-6 strain of Corynebacterium glutamicum {AppL MicrobioL BiotechnoL, 32, 269-273 (1989)) with the nu- 
cleotide sequence corresponding to the genome of Corynebacterium glutamicum ATCC 1 3032 according to the present 
invention, a mutation of amino acid replacement in which valine at the 59-posltlon is replaced with alanine (Val59Ala) 
was identified. A strain obtained by introducing this mutation into the ATCC 13032 strain by the gene replacement 
method can produce lysine, which indicates that this mutation is an effective mutation contributing to the production 
of lysine. 

[0136] SImilariy, by comparing the nucleotide sequence of pyaivate carboxylase gene pyc of the B-6 strain with the 
nucleotide sequence corresponding to the ATCC 1 3032 genome, a mutation of amino acid replacement in which proline 
at the 458-position was replaced with serine (Pro458Ser) was identified. A strain obtained by introducing this mutation 
into a lysine-producing strain of No. 58 (PERM BP-7134) of Corynebacterium glutamicum Uee of this mutation shows 
an improved lysine productivity In comparison with the No. 58 strain, which indicates that this mutation is an effectiv 
mutation contributing to the production of lysine. 

[0137] In addition, a mutation A1 a21 3Thr in glucose-6-phosphate dehydrogenase was specified as an effective mu- 
tation relating to the production of lysine by detecting glucose-6-phosphate dehydrogenase gene zwf of the B-6 strain. 
[0138] Furthenmore, the lysine-productivity of Corynebacterium glutamicum was improved by replacing the base at 
the 932-position of aspartokinase gene lysC of the Corynebacterium glutamicum ATCC 13032 genome with cytosine 
to thereby replace threonine at the 311 -position by isoleucine, which indicates that this mutation is an effective mutation 
contributing to the production of lysine. 

[01 39] Also, as another method to examine whether or not the identified mutation point is an effective mutation, there 
is a method in which the mutation possessed by the lysine-producing strain is retumed to the sequence of a wild type 
strain by the gene replacement method and whether or not it has a negative influence on the lysine productivity. For 
example, when the amino acid replacement mutation Val59Ala possessed by hom of the lysine-producing B-6 strain 
was retumed to a wild type amino acid sequence, the lysine productivity was lowered in comparison with the B-6 strain. 
Thus, it was found that this mutation is an effective mutation contributing to the production of lysine. 
[01 40] Effective mutation points can be more efficiently and comprehensively extracted by combining, if needed, the 
DNA an^y analysis or proteome analysis described below. 

6. Method of breeding industrially advantageous production strain 

[0141] It has been a general practice to construct production strains, which are used industrially in the fenmentation 
production of the target useful substances, such as amino acids, nucleic acids, vitamins, saccharides, organic acids, 
and the like, by repeating mutiagenesis and breeding based on random mutagenesis using mutagens, such as NTG 
or the like, and screening. 

[0142] In recent years, many examples of improved production strains have been made through the use of recom- 
binant DNA techniques. In breeding, however, most of the parent production strains to be improved are mutants ob- 
tained by a conventional mutagenic procedure (W. Leuchtenberger, Amino Acids - Technica! Production and Use. In: 
Roehr (ed) Biotechnology, second edition, vol. 6, products of primary metabolism. VCH Vertagsgesellschaft mbH, Wein- 
helm, P465 (1996)). 

[0143] Although mutagenesis methods hav largely contributed to the progress of the fermentation industry, they 
suffer from a serious problem of multiple, random introduction of mutations into very part of the chromosome. Since 
many mutations are accumulat d in a single chromosome each time a strain is improved, a production strain obtained 
by the random mutation and selecting is generally inferior in properties (for example, showing poor growth, delay d 
consumption of saccharides, and poor resistance to stresses such as temperaturo and oxygen) to a wild type strain, 
which brings about troubles such as failing to establish a suffici ntly elevat d productivity, being fr quently contami- 
nated with miscellaneous bacteria, requiring troublesome procedures in culture maintenance, and the like, and, in its 
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turn, elevating the production cost in practice. In addition, the Improv m nt in th productivity is based on random 
mutations and thus the mechanism thereof is unclear. Therefore, it is very difficult to plan a rational br eding strategy 
for the subsequent improvement in the productivity. 

[0144] According to the pr sent invention, effective mutation points contributing to th production can be effici ntfy 
5 specified from among many mutation points accumulated in tiie chromosome of a production strain which has be n 
bred from corynefonm bacteria and, therefore, a novel breeding method of assembling these effective mutations in the 
corynefomfi bacteria can be established. Thus, a useful production strain can be reconstructed. It is also possible to 
construct a useful production strain from a wild type strain. 
[0145] Specifically, a useful mutant can be constructed in the following manner. 
10 [01 46] One of the mutation points is incorporated into a wild type strain of corynef orm bacteria. Then, it is examined 
whettier or not a positive effect is established on the production. When a positive effect is obtained, the mutation point 
is saved. When no effect is obtained, the mutation point is removed. Subsequentiy, only a strain having the effective 
mutation point is used as the parent strain, and the same procedure is repeated. In general, the effectiveness of a 
mutation positioned upstream cannot beclearty evaluated in some cases when there is a rate-determining point in the 
15 downstream of a biosynthesis pathway. It is Uierefore preferred to successively evaluate mutation points upward from 
downstream. 

[01 47] By reconstituting effective mutations by the metiiod as described above in a wild type strain or a strain which 
has a high growth speed or the same ability to consume saccharides as tiie wild type strain, it Is possible to construct 
an industrially advantageous strain which is free of troubles in the previous methods as described above and to conduct 

20 fenmentation production using such strains within a short time or at a higher temperature. 

[0148] For example, a lysine-producing mutant B-6 (AppL Microbiol Biotechnot,, 32, 262-273 (1989)). which is ob- 
tained by multiple rounds of random mutagenesis from a wild type strain Corynebacterium giutamicum ATCC 13032, 
enables lysine femnentation to be performed at a temperature between 30 and 34**C but shows lowered growth and 
lysine productivity at a temperature exceeding 34^*0. Therefore, the fermentation temperature should be maintained 

25 at 34°C or lower. In contrast thereto, the production strain described in the above item 5, which is obtained by recon- 
stituting effective mutations relating to lysine production, can achieve a productivity at 40 to 42^C equal or superior to 
the result obtained by culturing at 30 to 34''C. Therefore, this strain is industrially advantageous since it can save the 
load of cooling during the fermentation. 

[0149] When culture should be carried out at a high temperature exceeding 43''C, a production strain capable of 
30 conducting fermentation production at a high temperature exceeding 43''C can be obtained by reconstituting useful 
mutations in a microorganism belonging to the genus CorynebacteriumyNhlch can grow at high temperature exceeding 
43*C. Examples of the microorganism capable of growing at a high temperature exceeding 43*'C include Corynebac- 
terium thermoaminogenes, such as Corynebactenum thermoaminogenes PERM 9244, PERM 9245, PERM 9246 and 
PERM 9247. 

35 [0150] A strain having a further improved productivity of the target product can be obtained using the thus recon- 
structed strain as the parent strain and furttier breeding it using the conventional mutagenesis metiiod, tiie gene am- 
plification method, the gene replacement metiiod using the recombinant DNA technique, the transduction method or 
the cell fusion method. Accordingly, the microorganism of the present Invention includes, but Is not limited to, a mutant, 
a cell fusion strain, a transformant, a transductant or a recombinant strain constructed by using recombinant DNA 

40 techniques, so long as it is a producing strain obtained via ttie step of accumulating at least two effective mutations in 
a corynefonm bacteria in the course of breeding. 

[0151] When a mutation point judged as being harmful to the growUi or production is specified, on the other hand, 
it is examined whether or not the producing strain used at present contains the mutation point. When it has the mutation, 
it can be retumed to the wild type gene and thus a further useful production strain can be bred. 
45 [0152] The breeding method as described above is applicable to microorganisms, other than corynefonn bacteria, 
which have industrially advantageous properties (for example, microorganisms capable of quickly utilizing less expen- 
sive cartx>n sources, microorganisms capable of growing at higher temperatures). 

7. Production and utilization of polynucleotide array 

50 

(1) Production of polynuci otide anray 

[0153] A polynucleotide array can be produced using the polynucleotide or oligonucleotide of the present inv ntion 
obtained in tiie abov items 1 and 2. 
55 [01 54] Examples include a polynucleotide an^ay comprising a solid support to which at least one of a polynucleotid 
comprising the nucleotide sequence represented by SEQ ID N0S:2 to 3501. a polynucleotide which hybridizes witti 
the polynucleotide under string nt conditions, and a polynucleotide comprising 10 to 200 continuous nucleotides in 
the nucleotide sequence of the polynucleotid is adhered; and a polynucleotide anBy comprising a solid support to 
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which at least one of a p lynucleotide encoding a polypeptid comprising the amino acid sequenc represented by 
any one of SEQ ID NOS:3502 to 7001, a polynucleotide which hybridizes with the polynucleotld under stringejit 
conditions, and a polynucleotide comprising 10 to 200 continuous bas s in the nucleotide sequences of the polynu- 
cleotides is adh red. 

5 [0155] Polynucleotld anays of th present invention include substrates Icnown in the art, such as a DNA chip, a 
DMA microanray and a DNA macroarray, and the like, and comprises a solid support and plural polynucleotides or 
fragments thereof which are adhered to the surface of the solid support. 
[0156] Examples of the solid support include a glass plate, a nylon membrane, and the like. 
[0157] The polynucleotides or fragments thereof adhered to the surface of the solid support can be adhered to th 

10 surface of the solid support using the general technique for preparing arrays. Namely, a method in which they are 
adhered to a chemically surface-treated solid support, for example, to which a polycation such as polylysine or the like 
has been adhered {Nat Genet, 21: 15-19 (1999)). The chemically surface-treated supports ace commercially availabi 
and the commercially available solid product can be used as the solid support of the polynucleotide array according 
to the present invention. 

15 [01 58] As the polynucleotides or oligonucleotides adhered to the solid support, the polynucleotides and oligonucle- 
otides of the present invention obtained in the above items 1 and 2 can be used. 

[01 59] The analysis described below can be efficiently perf omned by adhering the polynucleotides or oligonucleotides 

to the solid support at a high density, though a high fixation density is not always necessary. 

[0160] Apparatus for achieving a high fixation density, such as an arrayer robot or the like, is commercially available 

20 from Takara Shuzo (GMS417 Arrayer), and the commercially available product can be used. 

[0161] Also, the oligonucleotides of the present invention can be synthesized directly on the solid support by the 
photolithography method or the like (Nat Genet, 21: 20-24 (1999)). In this method, a linker having a protective group 
which can be removed by light irradiation is first adhered to a solid support, such as a slide glass or the like. Then, it 
is in^adiated with light through a mask (a photolithograph mask) pemneating light exclusively at a definite part of th 

25 adhesion part. Next, an oligonucleotide having a protective group which can be removed by light inradiation is added 
to the part. Thus, a ligation reaction with the nucleotide arises exclusively at the irradiated part. By repeating this 
procedure, oligonucleotides, each having a desired sequence, different from each other can be synthesized in respec- 
tive parts. Usually, the oligonucleotides to be synthesized have a length of 10 to 30 nucleotides. 

30 (2) Use of polynucleotide anray 

[01 62] The following procedures (a) and (b) can be camed out using the polynucleotide an^ay prepared In the abov 
(1). 

35 (a) Identification of mutation point of coryneform bacterium mutant and analysis of expression amount and expression 
profile of gene encoded by genome 

[0163] By subjecting a gene derived from a mutant of coryneform bacteria or an examined gene to the following 
steps (i) to (iv), the niutation point of the gene can be identified or the expression amount and expression profile of th 
40 gene can be analyzed: 

(i) producing a polynucleotide array by the method of the above (1); 

(ii) incubatingpolynucleotides immobilized on the polynucleotide array together with the labeled gene derived from 
a mutant of the coiynefonm bacterium using the polynucleotide array produced in the above (i) under hybridization 

45 conditions; 

(iii) detecting the hybridization; and 

(iv) analyzing the hybridization data. 

[0164] The gene derived frorii a mutant of coryneform bacteria or the examined gene include a gene relating to 
50 biosynthesis of at least one selected from amino acids, nucleic acids, vitamins, saccharides, organic acids, and ana- 
logues thereof. 

[01 65] The method will be described in detail. 

[0166] A single nucleotide polymorphism (SNP) in a human region of 2,300 kb has been identified using polynucle- 
otide arrays (Science, 280: 1077-82 (1998)). In accordance with th method of identifying SNP and methods describ d 
55 in Science, 27a 680-686 (1997); Proc. NatL Acad, Sci. USA, 98: 12833-38 (1999); Science, 284: 1520-23 (1999), and 
the like using the polynucleotide an^y produced in the abov (1 ) and a nucleic acid molecule (DNA, RNA) derived from 
coryneform bacteria in the method of th hybridization, a mutation point of a useful mutant, which is us ful in producing 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, or the like can be identifi d and the gen 
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expression amount and the expression profile thereof can be analyzed. 

[0167] The nucleic acid molecule (DNA, RNA) derived from th coryneform bacteria can be obtained according, to 
th general method described in Molecular Cloning, 2nd ed. or the like, mRN A derived from Corynebacterium glutami- 
cum can also be obtained by the method of Bormann et al. (Molecular Microbiology, ft 31 7-326 (1 992)) or the like. 
s [01 68] Although ribosomat RNA (rRNA) is usually obtained in large exc ss in addition to the target mRNA, the anal- 
ysis is not seriously disturbed thereby. 

[0169] The resulting nucleic acid molecule derived from coryneform bacteria is labeled. Labeling can be earned out 
according to a method using a fluorescent dye, a method using a radioisotope or the like. 

[0170] Specific examples include a labeling method in which psoralen-biotin is crosslinked with RNA extracted from 
10 a microorganism and, after hybridization reaction, a fluorescent dye having streptoavidin bound thereto is bound to 
the biotin moiety {Nat BiotechnoL, ie, 45-48 (1998)); a labeling method in which a reverse transcription reaction is 
eam'ed out using RNA extracted from a microorganism as a template and random primers as'primers, and dUTP having 
a fluorescent dye (for example, Cy3, Cy5) (manufactured by Amersham Pharmacia Biotech) is incorporated into cDNA 
(P/w. Natl. Acad, Sci. USA, 9a 12833-38 (1999)); and the like. 
15 [0171] The labeling specificity can be improved by replacing the random primers by sequences complementary to 
the 3'-end of ORF (J. BacterioL, 181: 6425-40 (1999)). 

[0172] In the hybridization method, the hybridization and subsequent washing can be canied out by the general 
method {Nat BioctechnoL, 14: 1675-80 (1996), or the like). 

[0173] Subsequently, the hybridization intensity is measured depending on the hybridization amount of the nucleic 
^ acid molecule used in the labeling. Thus, the mutation point can be identified and the expression amount of the gen 
can be calculated. 

[01 74] The hybridization intensity can be measured by visualizing the fluorescent signal, radioactivity, luminescence 
dose, and the like, using a laser confocal microscope, a CCD camera, a radiation imaging device (for example, STORM 
manufactured by Amersham Phamnacia Biotech), and the like, and then quantifying the thus visualized data. 
25 [01 75] A polynucleotide an^ay on a solid support can also be analyzed and quantified using a commercially available 
apparatus, such as GMS418 Anray Scanner (manufactured by Takara Shuzo) or the like. 

[01 76] The gene expression amount can be analyzed using a commercially available software (for example, ImaGene 
manufactured by Takara Shuzo; Array Gauge manufactured by Fuji Photo Film; ImageQuant manufactured by Amer- 
sham Phanmacia Biotech, or the like). 
30 [0177] A fluctuation in the expression amount of a specific gene can be monitored using a nucleic acid molecule 
obtained in the time course of culture as the nucleic acid molecule derived from coryneform bacteria. The culture 
conditions can be optimized by analyzing the fluctuation. 

[0178] The expression profile of the microorganism at the total gene level (namely, which genes among a great 
number of genes encoded by the genome have been expressed and the expression ratio thereoQ can be determined 
35 using a nucleic acid molecule having the sequences of many genes determined from the full genome sequence of the 
microorganism. Thus, the expression amount of the genes determined by the full genome sequence can be analyzed 
and, In its turn, the biological conditions of the microorganism can be recognized as the expression pattern at the full 
gene level. 

^ (b) Confirmation of the presence of gene homologous to examined gene in coryneform bacteria 

[0179] Whether or not a gene homologous to the examined gene, which is present in an organism other than co- 
ryneform bacteria, is present In corynefomi bacteria can be detected using the polynucleotide anBy prepared in the 
above (1). 

45 [0180] This detection can be carried out by a method in which an examined gene which is present in an organism 
other than corynefomn bacteria is used instead of the nucleic acid molecule derived from corynefomn bacteria used in 
the above identification/analysis method of (1). 

8. Recording medium storing full genome nucleotide sequence and ORF data and being readable by a computer and 
50 methods for using the same 

[01 81 ] The term "recording medium or storage devic which is readable by a computer" means a recording medium 
or storage medium which can be directly readout and accessed with a computer. Examples include magnetic recording 
media, such as a floppy disk, a hard disk, a magn tic tape, and the like; optical recording m dia, such as CD-ROM, 
55 CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, and the like; electric recording media, such as RAM, ROM, and the 
like; arid hybrids in these categories (for example, magnetic/optical recording m dia, such as MO and the lik ). 
[01 82] Instalments for recording or inputting in or on the recording medium r instruments or devices for reading out 
the infomnation in the recording medium can b appropriately selected, depending on the typ of the r cording medium 
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and th access d vice utilized. Also, various data processing progranis, software, comparator and formats are used 
for recording and utilizing the polynucleotide sequence infomnation or the like, of the present invention in the recording 
medium. The information can be expressed in the form of a binary file, a text file or an ASCII file formatted with com- 
mercially available software, for example. Moreover, software for accessing the sequenc information is available and 

s known to one of ordinary skill in the art 

[01 83] Examples of the information to be recorded in the above-described medium include the full genome nucleotide 
sequence infonmation of coryneform bacteria as obtained in the above item 2, the nucleotide sequence information of 
ORF, the amino acid sequence information encoded by the ORF, and the functional information of polynucleotides 
coding for the amino acid sequences. 

^0 [01 84] The recording medium or storage device which is readable by a cornputer according to the present invention 
refers to a medium In which the information of the present invention has been recorded. Examples include recording 
media or storage devices which are readable by a computer storing the nucleotide sequence information represent d 
by SEQ ID N0S:1 to 3501, the amino acid sequence infomnation represented by SEQ ID NOS:3502 to 7001, the 
functional information of the nucleotide sequences represented by SEQ ID N0S:1 to 3501 , the functional information 

^5 of the amino acid sequences represented by SEQ ID NOS:3502 to 7001, and the infomnation listed in Table 1 below 
and the like. 

9. System based on a computer using the recording medium of the present invention which is readable by a comput r 

20 [0185] The term "system based on a computer" as used herein refers a system composed of hardware device(s), 
software device(s), and data recording device(s) which are used for analyzing the data recorded In the recording me- 
dium of the present invention which is readable by a computer. 

[01 86] The hardware device(s) are, for example, composed of an input unit, a data recording unit, a central processing 
unit and an output unit collectively or individually 

25 [0187] By the software device(s), the data recorded in the recording medium of the present invention are searched 
or analyzed using the recorded data and the hardware device(s) as described herein. Specifically, the software device 
(s) contain at least one program which acts on or with the system in order to screen, analyze or compare biologically 
meaningful structures or infonmation from the nucleotide sequences, amino acid sequences and the like recorded in 
the recording medium according to the present invention. 

30 [0188] Examples of the software device(s) for identifying ORF and EMF domains include GeneMark {Nuc. Acids. 
Res,, 22: 4766-67 (1994)), GeneHacker (Prote/n, Nucieic Acid and Enzyme, 42. 3001-07 (1997)). Glimmer (The Insti- 
tute of Genomic Research; Nuc. Acids, Res., 26: 544-548 (1998)) and the like. In the process of using such a software 
device, the default (initial setting) parameters are usually used, although the parameters can be changed, if necessary, 
in a manner known to one of ordinary skill in the art. 

^ [0189] Examples of the software device(s) for identifying a genome domain or a polypeptide domain analogous to 
the target sequence or the target stmctural motif (homology searching) include FASTA, BLAST, Smith-Waterman, 
GenetyxMac (manufactured by Software Development), GCG Package (manufactured by Genetic Computer Group), 
GenCore (manufactured by Compugen), and the like. In the process of using such a software device, the default (initial 
setting) parameters are usually used, although the parameters can be changed, if necessary, in a manner known to 

^ one of ordinary skill in the art. 

[0190] Such a recording medium storing the full genome sequence data is useful in preparing a polynucleotide array 
by which the expression amount of a gene encoded by the genome DNA of corynefomn bacteria and the expression 
profile at the total gene level of the microorganism, namely, which genes among many genes encoded by the genome 
have been expressed and the expression ratio thereof, can be determined. 

^ [0191] The data recording device(s) provided by the present invention are, for example, memory device(s) for re- 
cording the data recorded in the recording medium of the present invention and target sequence or target stmctural 
motif data, or the like, and a memory accessing device(s) for accessing the same. 

[0192] Namely, the system based on a computer according to the present invention comprises the following: 

so (i) a user input device that inputs the information stored in the recording medium of the present invention, and 

target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input infomnation; 

(iii) a comparator that compares the information stored in the recording medium of the present invention with the 
target sequenc or targ t staicture motif infomnation, recorded by the data storing device of (ii) for screening and 

55 analyzing nucleotide sequence information which is coincident with or analogous to the target sequence or target 

structure motif information; and 

(iv) an output d vice that shows a screening or analyzing result obtained by the comparator. 
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[01 93] This system is usable in the methods in items 2 to 5 as descril>ed above for searching and analyzing the ORF 
and EMF domains, target sequence, target structural motif, etc. of a coryneform bacterium, searching homologs, 
searching and analyzing isozymes, d termining th biosynthesis pathway and the signal transmission pathway, and 
identifying spots which have been found in the proteome analysis. The tenn "homologs" as used herein includes both 
of orthologs and paralogs. 

10. Production of polypeptide using ORF derived from corynefomn bacteria 

[01 94] The polypeptide of the present invention can be produced using a polynucleotide comprising the ORF obtained 
in the above item 2. Specifically, the polypeptide of the present invention can be produced by expressing the polynu- 
cleotide of the present invention or a fragment thereof in a host cell, using the method described in Molecular Cloning, 
2nd ed.. Current Protocols in Molecular Biology, and the like, for example, according to the' following method. 
[0195] A DNA fragment having a suitable length containing a part encoding the polypeptide is prepared from the full 
length ORF sequence, if necessary. 

[0196] Also, DNA in which nucleotides in a nucleotide sequence at a part encoding the polypeptide of the present 
invention are replaced to give a codon suitable for expression of the host cell, if necessary. The DNA is useful for 
efficiently producing the polypeptide of the present invention. 

[0197] A recombinant vector is prepared by inserting the DNA fragment into the downstream of a promoter in a 
suitable expression vector. 

[0198] The recombinant vector is introduced to a host cell suitable for the expression vector. 

[0199] Any of bacteria, yeasts, animal cells. Insect cells, plant cells, and the like can be used as the host cell so long 

as it can be expressed in the gene of interest. 

[0200] Examples of the expression vector include those which can replicate autonomously in the above-described 
host cell or can be integrated into chromosome and have a promoter at such a position that the DNA encoding the 
polypeptide of the present Invention can be transcribed. 

[0201] When a procaryote cell, such as a bacterium or the like, is used as the host cell, it is preferred that the 
recombinant vector containing the DNA encoding the polypeptide of the present invention can replicate autonomously 
in the bacterium and is a recombinant vector constituted by, at least a promoter, a ribosome binding sequence, th 
DNA of the present invention and a transcription tenmination sequence. A promoter controlling gene can also be con- 
tained therewith in operable combination. 

[0202] Examples of the expression vectors include a vector plasmid which is replicable in Corynebacterium glutami- 
cum, such as pCGI (Japanese Published Unexamined Patent Application No. 1 34500/82). pCG2 (Japanese Published 
Unexamined Patent Application No. 35197/83), pCG4 (Japanese Published Unexamined Patent /Application No. 
183799/82), pCGII (Japanese Published Unexamined Patent Application No. 134500/82). pCGIIG. pGE54 and 
pCB101 (Japanese Published Unexamined Patent Application No. 105999/83), pCE51, pCE52 and pCE53 {MoL Gen, 
Genet, 196: 175-178 (1984)), and the like; a vector plasmid which is replicable in Escherichia coll, such as pET3 and 
pETII (manufactured by Stratagene). pBAD. pThioHis and pTrcHis (manufactured by Invitrogen), pKK223-3 and 
pGEX2T (manufactured by Amersham Phamnacia Biotech), and the like; and pBTrp2, pBTacI and pBTac2 (manufac- 
tured by Boehringer Mannheim Co.), pSE280 (manufactured by Invitrogen), pGEMEX-1 (manufactured by Promega), 
pQE-8 (manufactured by QIAGEN). pKYPIO (Japanese Published Unexamined Patent Application No. 110600/83), 
pKYP200 {Agric. Biol. Chem., 48: 669 (1984)), pLSAI {Agric. BioL Chem., 53: 277 (1989)), pGELI (Proc. Natl, Acad. 
Sci. USA, 82. 4306 (1985)), pBluescript II SK(-) (manufactured by Stratagene), pTrs30 (prepared from Escherichia coli 
JM109/pTrS30 (PERM BP-5407)), pTrs32 (prepared from Escherichia cotf JM109/pTrS32 (PERM BP-5408)), pGH/V2 
(prepared from Escherichia coli IGHA2 (PERM B-400), Japanese Published Unexamined Patent /Application No. 
221091/85), pGKA2 (preparedfrom Escherichia co//IGK/\2(FERM BP-6798), Japanese Published Unexamined Patent 
Application No. 221091/85). pTemri2 (U.S. Patents 4,686,191, 4.939,094 and 5,160.735), pSupex, pUBIIO, pTP5, 
pCI 94 and pEG400 (J. BacterioL, 172. 2392 (1 990)), pGEX (manufactured by Pharmacia), pET system (manufactured 
by Novagen). and the like. 

[0203] Any promoter can be used so long as it can function in the host cell. Examples include promoters derived 
from Escherichia coli, phage and the like, such as trp promoter (P^^) , lac promoter, Pl promoter, Pp promoter, T7 
promoter and the like. Also, artificially designed and modified promoters, such as a promoter in which two Pfrp are 
linked in series (P+^pX2) , tac promoter, /acT7 promoter /efl promoter and th like, can be used. 
[0204] It is preferred to use a plasmid in which the space between Shine-Dalgamo sequenc which is th ribosom 
binding sejquence and the initiation codon is adjusted to an appropriate distance (for xample. 6 to 18 nucleotides). 
[0205] The transcription tennination sequence is not always necessary for the expression of the DNA of the present 
invention. However, it is preferred to arrange the transcription temriinating sequence at just downstream of the structural 
gene. 

[0206] One of ordinary skill in the art will appreciate that the codons of the above-descn'bed el ments may be opti- 
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mized, in a known manner, depending on the host cells and environmental conditions utiliz d. 
[0207] Examples of the host c II include microorganisms belonging to th genus Escherichia, the genus Serratia, 
the genus Bac///a5, the g ms BrevibacteriumA^QqemsCorynebacteriumAh g r\us Micmbacterium,Xh genus Pseu- 
domonas, and the like. Specific examples include Escherichia co/y'XLI-Blue, Escherichia coii XLZ-Blue, Escherichia 
CO// DH1, Escherichia co// MC1000, Escherichia co//KY3276, Escherichia co//W1485, Escherichia co// JM109, Es- 
cherichiacoii HB^OI, Escherichia coiitio. 49, Escherichia coli\N3'iW, Escherichia coH hiY49, Escherichia cotiG\edB, 
Escherichia co// TB1, Serratia ficaria, Serratia fonticoia, Serratia iiquefaciens, Serratia marcescens, Baciilus subtilis, 
Bacillus amyioiiquefaciens, Corynebacterium ammonia genes, Brevibacterium ImmariophilumATCC 14068, Brevibac- 
terium saccharoiyticum ATCC 14066, C^rynebacteriumglutamicumAJCC 13032, Co/yne&acfe/7u/n glutamicumATCC 
1 3869, Corynebacteriumglutamicum fiJOC 1 4067 (prior genus and species: Brevibacterium flavum), Corynebacterium 
glutamicum fiJCC 13869 (prior genus and species: Brevibacterium lactofermentum, or Corynebacterium lactofermen- 
turn), Corynebacterium acetoacidophitum PJCC 13870, Corynebacterium thermoamlnogeries PERM 9244, Microbac- 
terium ammoniaphilum ATCC 1 5354, Pseudomonas putida, Pseudomonas sp. D-01 1 0, and the like. 
[0208] When Corynebacterium glutamicum or an analogous microorganism is used as a host, an EMF necessary 
for expressing the polypeptide is not always contained in the vector so long as the polynucleotide of the present in- 
vention contains an EMF. When the EMF is not contained in the polynucleotide, it is necessary to prepare the EMF 
separately and llgate it so as to be in operable combination. Also, when a higher expression amount or specific ex- 
pression regulation is necessary, it is necessary to ligate the EMF con^esponding thereto so as to put the EMF in 
operable combination with the polynucleotide. Examples of using an externally ligated EMF are disclosed in Microbi- 
ology, 142, 1 297-1 309 (1 996). 

[0209] With regard to the method for the introduction of the recombinant vector, any method for introducing DNA into 
the above-described host cells, such as a method in which a calcium ion is used (Proa Natl. Acad. Sci. USA, 6^ 2110 
(1972)), a protoplast method (Japanese Published Unexamined Patent Application No. 2483942/88), the methods 
described in Gene, IT. 107 (1982) and Molecular & General Genetics, 168: 111 (1979) and the like, can be used. 
[0210] When yeast is used as the host cell, examples of the expression vector include pYES2 (manufactured by 
Invitrogen), YEp13 (ATCC 37115), YEp24 (ATCC 37051), YCp50 (ATCC 37419), pHS19, pHSIS, and the like. 
[0211] Any promoter can be used so long as it can be expressed In yeast. Examples include a promoter of a gene 
in the glycolytic pathway, such as hexose kinase and the like, PH05 promoter, PGK promoter, GAP promoter, ADH 
promoter, gal 1 promoter, gal 10 promoter, a heat shock protein promoter, MF al promoter, CUP 1 promoter, and the like. 
[0212] Examples of the host cell include microorganisms belonging to the genus Saccharomyces, the genus 
Schizosaccharomyces, the genus Kluyveromyces, the genus Trichosporon, the genus Schwanniomyces, the genus 
Pichia, the genus Candida and the like. Specific examples include Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Kluyveromyces lactis, Trichosporon pullulans, Schwanniomyces alluvius, Candida utillsand the like. 
[021 3] With regard to the method for the introduction of the recombinant vector, any method for introducing DNA into 
yeast, such as an electroporation method {Methods. Enzymol., 194: 182 (1990)), a spheroplast method {Proc. Natl. 
Acad. Sci. USA, 75: 1929 (1978)), a lithium acetate method (J. Bacterid., 153: 163 (1983)), a method described in 
Proc. Natl. Acad. ScL USA, 75: 1929 (1978) and the like, can be used. 

[021 4] When animal cells are used as the host ceils, examples of the expression vector include pcDNA3. 1 , pSinRepS 
and pCEP4 (manufactured by Invitorogen), pRev-Tre (manufactured by Clontech), pAxCAwt (manufactured by Takara 
Shuzo), pcDNAl and pcDM8 (manufactured by Funakoshi), pAGE107 (Japanese Published Unexamined Patent Ap- 
plication No. 22979/91; Cytotechnotogy, 3:133 (1990)), pAS3-3 (Japanese Published Unexamined Patent Application 
No. 227075/90), pcDM8 (Nature, 329: 840 (1987)), pcDNAI/Amp (manufactured by Invitrogen), pREP4 (manufactured 
by Invitrogen), pAGE103 (J. Biochem., 101: 1307 (1987)). pAGE210, and the like. 

[0215] Any promoter can be used so long as it can function in animal cells. Examples include a promoter of IE 
(immediate early) gene of cytomegalovirus (CMV), an early promoter of SV40, a promoter of retrovirus, a metal- 
lothionein promoter, a heat shock promoter, SRa promoter, and the like. Also, the enhancer of the IE gene of human 
CMV can be used together with the promoter. 

[0216] Examples of the host cell include human Namalwa cell, monkey COS cell. Chinese hamster CHO cell, 
HST5637 (Japanese Published Unexamined Patent Application No. 299/88), and the like. 

[0217] The method for introduction of the recombinant vector into animal cells is not particularly limited, so long as 
it is the general method for introducing DNA Into animal cells, such as an electroporation method (Cytotechnotogy, 3. 
133 (1990)), a calcium phosphate method (Japanese Published Unexamined Patent Application No. 227075/90), a 
lipofection method (Proc. Natl, Acad. ScL USA, 84, 7413 (1987)), the method described in Virology, 52. 456 (1973), 
and the like. 

[021 8] When insect cells are used as tiie host cells, the polypeptide can be expressed, for example, by the method 
described in Bacurovirus Expression Vectors, A Laboratory Manual, W.H. Fr eman and Company, New York (1992), 
Bio/Technology, 8. 47 (1988), or the like. 

[021 9] Specifically, a recombinant gen transfer vector and bacurovims are simultaneously inserted into insect cells 
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to obtain a recombinant virus in an insect c II culture supernatant, and then the insect c its are infected with the resulting 
recombinant virus to e)q3ress the polypeptide. 

[0220] Examples of the gene introducing vector used in th method Include pBlueBac4.5. pVL1392. pVL1393 and 
pBlueBaclll (manufactured by Invltrogen), and the like. 

[0221] Examples of the bacurovirus includ Autographa calif omica nuclear polyhedrosis vims with which insects of 
the family Bamthra are infected, and the like. 

[0222] Examples of the insect cells Include Spodoptera frugiperda oocytes Sf9 and Sf21 (Bacurovirus Expression 
Vectors, A Laboratory Mar)usd, W.H. Freeman and Company, New York (1992)), Tnchopiusia n/ oocyte High 5 (manu- 
factured by Invitrogen) and the like. 

[0223] The method for simultaneously incorporating the above-descrlt>ed recombinant gene transfer vector and the 
eibove-described bacurovirus for the preparation of the recombinant virus include calcium phosphate method (Japanese 
Published Unexamined Patent Application No. 227075/90), lipof action method (Proa Nati'. Acad, Sci. USA, 84: 7413 
(1987)) and the like. 

[0224] When plant cells are used as the host cells, examples of expression vector include a Tl plasmid, a tobacco 
mosaic virus vector, and the like. 

[0225] Any promoter can be used so long as it can be expressed in plant cells. Examples include 35S promoter of 
cauliflower mosaic virus (CaMV), rice actin 1 promoter, and the like. 

[0226] Examples of the host cells include plant cells and the like, such as tobacco, potato, tomato, carrot, soybean, 
rape, alfalfa, rice, wheat, barley, and the like. 

[0227] The method for introducing the recombinant vector is not particularly limited, so long as it is the general method 
for introducing DNA into plant cells, such as the Agrobacterium method (Japanese Published Unexamined Patent 
Application No. 140886/84. Japanese Published Unexamined Patent Application No. 70080/85, WO 94/00977), the 
electroporation method (Japanese Published Unexamined Patent Application No. 251887/85), the particle gun method 
(Japanese Patents 2606856 and 251 781 3), and the like. 

[0228] The transfonmant of the present invention includes a transfonnant containing the polypeptide of the pres nt 
invention per se rather than as a recombinant vector, that Is, a transf ormant containing the polypeptide of the pres nt 
invention which is integrated into a chromosome of the host, in addition to the transfonnant containing the abov 
recombinant vector. 

[0229] When expressed in yeasts, animal cells, insect cells or plant cells, a glycopolypeptide or glycosylated polyp p- 
tide can be obtained. 

[0230] The polypeptide can be produced by culturing the thus obtained transformant of the present invention in a 
culture medium to produce and accumulate the polypeptide of the present invention or any polypeptide expressed 
under the control of an EMF of the present invention, and recovering the polypeptide from the culture. 
[0231] Culturing of the transfonnant of the present invention in a culture medium is canied out according to the 
conventional method as used in culturing of the host. 

[0232] When the transformant of the present invention is obtained using a prokaryote. such as Escherichia coli or 
the like, or a eukaryote. such as yeast or the like, as ttie host, the transformant is cultured. 
[0233] Any of a natural medium and a synthetic medium can be used, so long as it contains a cari[}on source, a 
nitrogen source, an inorganic salt and the like which can be assimilated by the transformant and can perfonn culturing 
of the transformant efficiently. 

[0234] Examples of the cari^on source include those which can be assimilated by the transformant, such as carbo- 
hydrates (for example, glucose, f mctose. sucrose, molasses containing them, starch, starch hydrolysate. and the like), 
organic acids (for example, acetic acid, propionic acid, and the like), and alcohols (for example, ethanol, propanol, and 
the like). 

[0235] Examples of the nitrogen source include ammonia, various ammonium salts of inorganic acids or organic 
acids (for example, ammonium chloride, ammonium sulfate, ammonium acetate, ammonium phosphate, and the like), 
other nitrogen-containing compounds, peptone, meat extract, yeast extract, com steep liquor, casein hydrolysate, soy- 
bean meal and soybean meal hydrolysate, various femnented cells and hydrolysates thereof, and the like. 
[0236] Examples of inorganic salt include potassium dihydrogen phosphate, dipotassium hydrogen phosphate, mag- 
nesium phoisphate, magnesium sulfate, sodium chloride, ferrous sulfate, manganese sulfate, copper sulfate, calcium 
carix)nat , and the like. 

[0237] The culturing is carried out under aerobic conditions by shaking culture, submerged-a ration stining culture 
or the like. The culturing temperatur is preferably from 15 to 40*'C, and the culturing time is generally from 16 hours 
to 7 days._The pH of the medium is preferably maintained at 3.0 to 9.0 during th culturing. The pH can be adjusted 
using an inorganic or organic acid, an alkali solution, urea, calcium cartx)nate, ammonia, or the like. 
[0238] Also, antibiotics, such as ampfcillin, tetracycline, and the Ilk , can be add d to the medium during the culturing, 
if necessary. 

[0239] When a microorganism transformed with a recombinant vector containing an inducible promoter is cultured, 
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an inducer can be added to th medium, if necessary. 

[0240] For example, isopropyl-p-D-thiogalactopyranoside (IPTG) or tlie like can be added to the medium wheo a 
microorganism transformed with a recombinant vector containing lac promoter is cultured, or indoleacrylic acid (lAA) 
or the like can by added ther to when a microorganism transformed with an expression vector containing trp promoter 
5 is cultured. 

[0241] Examples of the medium used in culturing a transfomnant obtained using animal cells as the host cells include 
RPM1 1640 medium {The Journal of the American Medical Assodation, 199: 619 (1967)), Eagle's MEM medium {Sci- 
ence, 122: 501 (1952)), Dulbecco's modified MEM medium {Virology, ft 396 (1959)), 199 Medium {Proceeding of the 
Sodety for the Biological Medicine, 7a 1 (1 950)), the above-described media to which fetal calf semm has been added, 
10 and the like. 

[0242] The culturing is carried out generally at a pH of 6 to 8 and a temperature of 30 to 40^C in the presence of 5% 
CO2 for 1 to 7 days. 

[0243] Also, if necessary, antibiotics, such as kanamycin, penicillin, and the like, can be added to the medium during 
the culturing. 

15 [0244] Examples of the medium used in culturing a transfomnant obtained using insect cells as the host cells include 
TNM-FH medium (manufactured by Phanmingen), Sf-900 1 1 SFM (manufactured by Life Technologies), ExCell 400 and 
ExCell 405 (manufactured by JRH Biosciences), Grace's Insect Medium (Nature, 195: 788 (1962)), and the like. 
[0245] The culturing is cam'ed out generally at a pH of 6 to 7 and a temperature of 25 to 30*0 for 1 to 5 days. 
[0246] Additionally, antibiotics, such as gentamicin and the like, can be added to the medium during the culturing, if 

20 necessary. 

[0247] A transformant obtained by using a plant cell as the host cell can be used as the cell or after differentiating 
to a plant cell or organ. Examples of the medium used in the culturing of the transfomnant include Murashige and Skoog 
(MS) medium, White medium, media to which a plant homnone, such as auxin, cytokinine, or the like has been added, 
and the like. 

25 [0248] The culturing is canied out generally at a pH of 5 to 9 and a temperature of 20 to 4(PC for 3 to 60 days. 

[0249] Also, antibiotics, such as kanamycin, hygromycin and the like, can be added to the medium during the cul- 
turing, if necessary. 

[0250] As described above, the polypeptide can be produced by culturing a transformant derived from a microor- 
ganism, animal cell or plant cell containing a recombinant vector to which a DNA encoding the polypeptide of the 
50 present invention has been inserted according to the general culturing method to produce and accumulate the polypep- 
tide, and recovering the polypeptide from the culture. 

[0251] The process of gene expression may include secretion of the encoded protein production or fusion protein 
expression and the like in accordance with the methods described in Molecular Cloning, 2nd ed., in addition to direct 
expression. 

35 [0252] The method for producing the polypeptide of the present invention includes a method of intracellular expres- 
sion in a host cell, a method of extracellular secretion from a host cell, or a method of production on a host cell memb rane 
outer envelope. The method can be selected by changing the host ceil employed or the structure of the polypeptid 
produced. 

[0253] When the polypeptide of the present invention is produced in a host cell or on a host cell membrane outer 
"fo envelope, the polypeptide can be positively secreted extracellularly according to, for example, the method of Paulson 
etal, {J. BioL Chen)., 264: 17619 {1909)), the method of Lowe etaL {Proc. Natl. Acad. Sci. USA, 86: 8227 (1989); 
Genes Develop., 4: 1 288 ( 1 990)), and/or the methods described in Japanese Published Unexamined Patent Application 
No. 336963/93, WO 94/23021 , and the like. 

[0254] Specifically, the polypeptide of the present invention can be positively secreted extracellularly by expressing 
^ it in the fomi that a signal peptide has been added to the foreground of a polypeptide containing an active site of the 
polypeptide of the present invention according to the recombinant DNA technique. 

[0255] Furthemnore, the amount produced can be increased using a gene amplification system, such as by use of 
a dihydrofolate reductase gene or the like according to the method described in Japanese Published Unexamined 
Patent Application No. 227075/90. 
so [0256] Moreover, the polypeptide of the present invention can be produced by a transgenic animal individual (trans- 
genic nonhuman animal) or plant individual (transgenic plant). 

[0257] When the transformant is the animal individual or plant individual, the polypeptide of the present invention 
can be produced by breeding or cultivating it so as to produce and accumulat the polypeptide, and recovering the 
polypeptide from the animal individual or plant individual. 
55 , [0258] Examples of the method for producing the polypeptide of the present inv ntion using the animal individual 
include a method for producing the polypeptide of the present invention in an animal developed by inserting a gen 
according to methods known to those off ordinary skill in the art {American Journal of Clinical Nutrition, 63: 639S (1 996), 
American Journal of Clinical Nutrition, 6S. 627S (1996), Bio/Technology, 9. 830 (1991)). 
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[0259] In the animal individual, the polyp ptide can be produced by br ding a transgenic nonhuman animal to which 
the DNA ncoding the polypeptide of the present invention has been inserted to produce and accumulate th polypep- 
tide in the animal, and recovering the polypeptid from the animal. Examples of the production and accumulation place 
In the animal include milk (Japanese Published Unexamined Patent Application No. 309192/88), gg and the like of 
the animal. Any promoter can be used, so long as it can be expressed in the animal. Suitable examples include an a- 
casein promoter, a (P-casein promoter, a p-lactoglobulin promoter, a whey acidic protein promoter, and the like, which 
are specific for mammary glandular cells. 

[0260] Examples of the method for producing the polypeptide of the present invention using the plant Individual 
include a method for producing the polypeptide of the present Invention by cultivating a transgenic plant to which th 
DNA encoding the protein of the present invention by a known method [Tissue Culture, 20 (1994), Tissue Culture, 21 
(1994), Trendsin Biotechr)ology, 15:45 (1997)) to produce and accumulate the polypeptide in the plant, and recovering 
the polypeptide from the plant. 

[0261] The polypeptide according to the present invention can also be obtained by translation in vitro, 
[0262] The polypeptide of the present invention can be produced by a translation system in vitro. There are. for 
example, two in wfro translation methods which may be used, namely, a method using RNA as a template and another 
method using DNA as a template. The template RNA includes the whole RNA. mRNA, an in vitro transcription product, 
and the like. The template DNA includes a plasmid containing a transcriptional promoter and a target gene integral d 
therein and downstream of the initiation site, a PCR/RT-PCR product and the like. To select the most suitable system 
for the in vitro translation, the origin of the gene encoding the protein to be synthesized (prokaiyotic cell/eucaryotic 
cell), the type of the template (DNA/RNA), the purpose of using the synthesized protein and the like should be consid- 
ered. In vitro translation kite having various characteristics are commercially available from many companies (Boe- 
hringer Mannheim, Promega, Stratagene, or the like), and every kit can be used in producing the polypeptide according 
to the present invention. 

[0263] Transcription/translation of a DNA nucleotide sequence cloned into a plasmid containing a T7 promoter can 
be canied out using an in wf/x? transcription/translation system E. coll J7 S30 Extract System for Circular DNA (man- 
ufactured by Promega, catalogue No. L1130). Also, transcription/translation using, as a template, a linear prokaryotic 
DNA of a supercoil non-sensitive promoter, such as /acUV5, tac, APL(con), XPL, or the like, can be carried out using 
an in wYro transcription/translation system E. coll S30 Extract System for Linear Templates (manufactured by Promega, 
catalogue No. LI 030). Examples of the linear prokaryotic DNA used as a template include a DNA fragment, a PCR- 
amplified DNA product, a duplicated oligonucleotide ligation, an in vitro transcriptional RNA, a prokaryotic RNA, and 
the like. 

[0264] In addition to the production of the polypeptide according to the present invention, synthesis of a radioactive 
labeled protein, confinnation of the expression capability of a cloned gene, analysis of the function of transcriptional 
reaction or translation reaction, and the like can be carried out using this system. 

[0265] The polypeptide produced by the transfomiant of the present invention can be isolated and purified using th 
general method for isolating and purifying an enzyme. For example, when the polypeptide of the present invention is 
expressed as a soluble product in the host cells, the cells are collected by centrifugation after cultivation, suspended 
in an aqueous buffer, and disrupted using an ultrasonicator, a French press, a Manton Gaulin homogenizer, a Dynomill, 
or the like to obtain a cell-free extract. From the supernatant obtained by centrifuging the cell-free extract, a purifi d 
product can be obtained by the general method used for isolating and purifying an enzyme, for example, solvent ex- 
traction, salting out using ammonium sulfate or the like, desalting, precipitation using an organic solvent, anion ex- 
change chromatography using a resin, such as diethylaminoethyl (DEAE)-Sepharose. DIAION HPA-75 (manufactured 
by Mitsubishi Chemical) or the like, cation exchange chromatography using a resin, such as S-Sepharose FF (manu- 
factured by Phannacia) or the like, hydrophobic chromatography using a resin, such as butyl sepharose, phenyl sepha- 
ros&or the like, gel filtration using a molecular sieve, affinity chromatography, chromatofocusing, or electrophoresis, 
such as isoelectronic focusing or the like, alone or in combination thereof. 

[0266] When the polypeptide is expressed as an Insoluble product in the host cells, the cells are collected in the 
same manner, disrupted and centrifuged to recover the insoluble product of the polypeptide as the precipitate fraction. 
Next, the insoluble product of the polypeptide is solubliized with a protein denaturing agent. The solubilized solution 
is diluted or diaiyzed to tower the concentration of the protein denaturing agent in the solution. Thus, the normal con- 
figuration of the polypeptide is reconstituted. After the procedure, a purified product of the polypeptide can be obtained 
by a purification/isolation method similar to the above. 

[0267] When the polypeptide of the pres nt invention or its derivative (for xample, a polypeptide fomned by adding 
a sugar chain thereto) is secreted out of cells, the polypeptide or its derivative can be collected in the culture supernatant. 
Namely, the culture supernatant is obtained by treating th culture medium in a treatment similar to the above (for 
example, centrifugation). Then, a purified product can b obtained from the culture medium using a purification/isolation 
method similar to the abov . 

[0268] The polypeptide obtain^d by th abovemethodiswithinthescopeof the polypeptide of the present inv ntion. 
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and examples include a polypeptide needed by a polynucleotide comprising the nucleotide sequence selected from 
SEQ ID N0S:2 to 3431, and a polypeptid comprising an amino acid sequ nee repres nted by any one of SEQ JD 
NOS:3502to6931. 

[0269] Furthermore, a polypeptide comprising an amino acid sequence In which at least one amino acids is deleted, 

5 replaced, Inserted or added in the amino acid sequenc of the polyp ptide and having substantially the same activity 
as that of the polypeptide is included in the scope of the present invention. The temn "substantially the same activity 
as that of the polypeptide" means the same activity represented by the inherent function, enzyme activity or the like 
possessed by the polypeptide which has not been deleted, replaced, inserted or added. The polypeptide can be ob- 
tained using a method for introducing part-specific mutation(s) described in. for example. Molecular Cloning, 2nd ed., 

10 Current Protocols in Molecular Biology, Nuc, Acids. Res., 10. 6487 (1982), Proc. Natt. Acad. ScL USA, 79: 6409 (1982), 
Gene, 34: 315 (1985). Nuc. Acids. Res., 13: 4431 (1985). Proc. Nat!. Acad. Sci. USA, 82. 488 (1985) and the like. For 
example, the polypeptide can be obtained by introducing mutation(s) to DNA encoding a polypeptide having the amino 
acid sequence represented by any one of SEQ ID NOS:3502 to 6931 . The number of the amino acids which are deleted, 
replaced, inserted or added is not particularty limited; however, it is usually 1 to the order of tens, preferably 1 to 20, 

15 more preferably 1 to 10, and most preferably 1 to 5, amino acids. 

[0270] The at least one amino acid deletion, replacement, insertion or addition in the amino acid sequence of th 
polypeptide of the present invention is used herein to refer to that at least one amino acid is deleted, replaced, inserted 
or added to at one or plural positions in the amino acid sequence. The deletion, replacement, insertion or addition may 
be caused in the same amino acid sequence simultaneously. Also, the amino acid residue replaced, inserted or added 

20 can be natural or non-nafural. Examples of the natural amino acid residue include L-alanine, L-asparagine, L-asparatic 
acid, L-glutamine. L-glutamic acid, glycine, L-histidine, L-isoleucine, L-leucine, L-lysine. L-methionine, L-phenylalanin , 
L-proline, L-serine, L-threonine, L-tryptophan, L-tyrosine, L-valine. L-cysteine, and the like. 
[0271] Herein, examples of amino acid residues which are replaced with each other are shown below. The amino 
acid residues in the same group can be replaced with each other. 

25 

Group A: 

[0272] leucine, isoleucine, norieucine, valine, norvaline, alanine, 2-aminobutanoic acid, methionine, O-methylserine, 
t-butylglycine, t-butylalanine, cyclohexylalanine; 

30 

Group B: 

[0273] asparatic acid, glutamic acid, isoasparatic acid, isoglutamic acid, 2-aminoadipic acid, 2-aminosuberic acid; 
35 Group C: 

[0274] asparagine, glutamine; 
Group D: 

40 

[0275] lysine, arginlne, omithine, 2,4-diaminobutanoic acid, 2.3-diaminopropionic acid; 
Group E: 

^ [0276] proline, 3-hydroxyproline, 4-hydroxyproline; 
Group F: 

[0277] serine, threonine, homoserine; 

50 

Group G: 

[0278] phenylalanine, tyrosine. 

[0279] Also, in rd r that the resulting mutant polypeptide has substantially the same activity as that of the polypeptide 
55 which has' not been mutated, it is prefenred that the mutant polypeptide has a homology f 60% or more, preferably 
80% or more, and particulariy preferably 95% or more, with the pol\q>eptide which has not been mutated, when calcu- 
lated, for xampi , using d fault (initial setting) parameters by a homology searching software, such as BLAST, FASTA, 
or the like. 
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[0280] Also, the polypeptide of the pr sent invention can be produced by a chemical synthesis method, such as 
Fmoc (fluorenylmethyloxycarbonyt) method, tBoc (t*butyloxycart>onyl) method, or the lik . It can also be synthesized 
using a p ptide synthesizer manufactured by Advanced ChemTech, Perkin-Elmer, Pharmacia. Protein T chnology 
Instrument, Synthecell-Vega, PerSeptive, Shimadzu Corporation, or the like. 
5 [0281 ] The transformant of the present invention can be used for objects other than the production of the polypeptide 
of the present invention. 

[0282] Specifically, at least one component selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an 
organic acid, and analogues thereof can be produced by culturing the transfonmant containing the polynucleotid or 
recombinant vector of the present invention in a medium to produce and accumulate at least one component selected 
10 from amino acids, nucleic acids, vitamins, saccharides, organic acids, and analogues thereof, and recovering the same 
from the medium. 

[0283] The biosynthesis pathways, decomposition pathways and regulatory mechanisms of physiologically active 
substances such as amino acids, nucleic acids, vitamins, saccharides, organic acids and analogues thereof differ from 
organism to organism. The productivity of such a physiologically active substance can be Improved using these differ- 

15 ences, specifically by introducing a heterogeneous gene relating to the biosynthesis thereof. For example, the content 
of lysine, which is one of the essential amino acids, in a plant seed was improved by introducing a synthase gen 
derived from a bacterium (WO 93/1 91 90). Also, arginine is excessively produced in a culture by Introducing an arginine 
synthase gene derived from Escherichia ca// (Japanese Examined Patent Publication 23750/93). 
[0284] To produce such a physiologically active substance, the transformant according to the present invention can 

20 be cultured by the same method as employed in culturing the transformant for producing the polypeptide of the present 
invention as described at>ove. /VIso, the physiologically active substance can be recovered from the culture medium 
in combination with, for example, the ion exchange resin method, the precipitation method and other known methods. 
[0285] Examples of methods known to one of ordinary skill in tiie art include electroporation, calcium transfection, 
the protoplast method, the method using a phage, and the like, when the host is a bacterium; and microinjection, 

25 calcium phosphate transfection, the positively charged lipid-mediated method and the method using a virus, and the 
like, when the host is a eukaryote {Molecular Cloning, 2nd ed.; Specter et aL, Cells/a laboratory manual, Cold Spring 
Harbour Laboratory Press, 1998)). Examples of the host include prokaryotes, tower eukaryotes (for example, yeasts), 
higher eukaryotes (for example, mammals), and cells isolated therefrom. As the state of a recombinant polynucleotide 
fragment present in the host cells, it can be integrated into the chromosome of the host. Altematively, it can be integrated 

30 Into a factor (for example, a plasmid) having an independent replication unit outside the chromosome. These trans- 
formants are usable In producing the polypeptides of the present invention encoded by tiie ORF of the genome of 
Corynebacterium glutamicum, the polynucleotides of the present invention and fragments thereof. Altematively, tiiey 
can be used in producing arbitrary polypeptides under the regulation by an EMF of the present invention. 

35 11. Preparation of antibody recogriizing the polypeptide of the present invention 

[0286] An €mtibody which recognizes the polypeptide of the present invention, such as a polyclonal antibody, a mon- 
oclonal antibody, or the like, can be produced using, as an antigen, a puritied product of the polypeptide of the pres nt 
invention or a partiaf fragment polypeptide of the polypeptide or a peptide having a partial amino acid sequence of the 
^ polypeptide of the present invention. 

(1 ) Production of polyclonal antibody 

[0287] A polyclonal antibody can be produced using, as an antigen, a purified product of the polypeptide of the 
^ present invention, a partial fragment polypeptide of tiie polypeptide, or a peptide having a partial amino acid sequence 
of the polypeptide of the present invention, arid immunizing an animal with the same. 

[0288] Exarnples of ttie animal to be immunized include rabbits, goats, rats, mice, hamsters, chickens and the like. 
[0289] A dosage of the antigen is preferably 50 to 1 GO ^g per animal. 

[0290] When the peptide is used as the antigen, it is preferably a peptide covalentiy bonded to a carrier protein, such 
so as keyhole limpet haemocyanin, bovine thyroglobulin, or the like. The peptide used as tiie antigen can be synthesiz d 
by a peptide synth sizer. 

[0291] The administration of th antigen is, for example, canied out 3 to 10 times at the intervals of 1 or 2 weeks 
after the first administration. On the 3rd to 7th day after each administration, a blood sample is collected from the 
venous plexus of the eyeground, and it Is confimned that the serum reacts with the antigen by the enzyme immunoassay 
55 {Enzyme-linked Immunosorbent Assay (EUSA), Igaku Shoin (1 976) ; Antibodies - A Laboratory Manual, Cold Spring 
Harbor Laboratory (1988)) or the like. 

[0292] Serum is obtained from the immuniz d non-human mammal with a sufficient antib dy titer against the antigen 
used for the immunization, and the serum is isolated and purified to obtain a polyclonal antibody 
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[0293] Examples of the method for the isolation and purification include c ntrifugation. salting out by 40-50% satu- 
rated ammonium sulfate, caprylic acid precipitation {Antibodies, A Laboratory manual, Cold Spring Harbor Laboratory 
(1988)), or chromatography using a DEAE-Sepharos column, an anion exchange column, a protein A- or G-column. 
a g I filtration column, and the like, alone or in combination thereof, by methods known to those of ordinary skill In the art. 

(2) Production of monoclonal antibody 

(a) Preparation of antibody-producing cell 

[0294] A rat having a serum showing an enough antibody titer against a partial fragment polypeptide of the polypep- 
tide of the present invention used for immunization is used as a supply source of an antibody-producing cell. 
[0295] On thie 3rd to 7th day after the antigen substance is finally administered the rat shewing the antibody titer, the 
spleen is excised. 

[0296] The spleen is cut to pieces in MEM medium (manufactured by Nissui Phamnaceutical), loosened using a pair 
of forceps, followed by centrifugation at 1,200 rpm for 5 minutes, and the resulting supernatant is discarded. 
[0297] The spleen in the precipitated fraction is treated with a Tris-ammonium chloride buffer (pH 7.65) for 1 to 2 
minutes to eliminate erythrocytes and washed three times with MEM medium, and the resulting spleen cells are used 
as antibody-producing cells. 

(b) Preparation of myeloma cells 

[0298] As myeloma cells, an established cell line obtained from mouse or rat is used. Examples of useful cell lines 
include those derived from a mouse, such as P3-X63Ag8-U1 (hereinafter referred to as "PS-UI ") {Cum Topics in Micro- 
bioL Immunol,, 81: 1 (1978); Europ. d Immunol., &. 511 (1976)); SP2/(>Agl4 (SP-2) {Nature, 276: 269 (1978)): 
P3.X63-Ag8653 (653) (J. Immunol., 123: 1548 (1979)); P3-X63-Ag8 (X63) cell line {Nature, 256. 495 (1975)). and the 
like, which are 8-azaguanine-rBsistant mouse (BALB/c) myeloma cell lines. These cell lines are subcultured in 8-aza- 
guanine medium (medium in which, to a medium obtained by adding 1:5 mmol/l glutamine. 5x10-5 mol/l 2-mercap- 
toethanol, 10 ^lg/ml gentamicin and 10% fetal calf semm (PCS) (manufactured by CSL) to RPMI-1640 medium (here- 
inafter referred to as the "nornial medium"), 8-azaguanine is further added at 15 ^ig/ml) and cultured in the normal 
medium 3 or 4 days before cell fusion, and 2x10^ or more of the cells are used for the fusion. 

(c) Production of hybridoma 

[0299] The antibody-producing cells obtained in (a) and the myeloma cells obtained in (b) are washed with MEM 
medium or PBS (disodium hydrogen phosphate: 1 .83 g. sodium dihydrogen phosphate: 0.21 g, sodium chloride: 7.65 
g, distilled water: 1 liter, pH: 7.2) and mixed to give a ratio of antibody-producing cells : myeloma cells = 5 : 1 to 10 : 
1, followed by centrifugation at 1,200 rpm for 5 minutes, and the supematant is discarded. 

[0300] The cells in the resulting precipitated fraction were thoroughly loosened, 0.2 to 1 ml of a mixed solution of 2 
g of polyethylene glycoi-1000 (PEG-1000). 2 ml of MEM medium and 0.7 ml of dimethylsulfoxide (DMSO) per 10^ 
antibody-producing cells is added to the cells under stimng at 37*»C, and then 1 to 2 ml of MEM medium is further 
added thereto several times at 1 to 2 minute inten/als. 

[0301] After the addition, MEM medium is added to give a total amount of 50 ml. The resulting prepared solution is 
centrifuged at 900 rpm for 5 minutes, and then the supematant is discarded. The cells in the resulting precipitated 
fraction were gently loosened and then gently suspended in 100 ml of HAT medium (the nomial medium to which 10^ 
mol/l hypoxanthine, 1.5x10-5 mol/l thymidine and 4x10"^ mol/l aminopterin have been added) by repeated drawing 
up into and discharging from a measuring pipette. 

[0302] The suspension is poured into a 96 well culture plate at 100 ^1/well and cultured at 37''C for 7 to 14 days in 
a 5% CO2 incubator. 

[0303] After culturing, a part of the culture supematant is recovered, and a hybridoma which specifically reacts with 
a partial fragment polypeptide of the polypeptide of the present invention is selected according to the enzyme immu- 
noassay described in Antibodies, A Laboratory manual. Cold Spring Harbor Laboratory, Chapter 1 4 (1998) and the like. 
[0304] A specific example of the enzyme immunoassay is described below. 

[0305] The partial fragment polypeptide of the polypeptid of the present invention used as the antigen in the immu- 
nization is spread on a suitable plate, is allowed to react with a hybridoma culturing supematant or a purified antibody 
obtained in (d) described t>elow as a first antibody, and Is further allowed to react with an anti-rat or anti-mouse immu- 
noglobulin antibody labeled with an nzym , a chemical luminous substance, a radioactive substance or the lik as a 
second antibody for reaction su'rtable for the labeled substance. A hybridoma which specifically reacts with the polyp p- 
tlde of the present invention is s lected as a hybridoma capable of producing a monoclonal antibody of the pres nt 
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invention. 

[0306] Cloning is repeat d using the hybridoma twice by limiting dilution analysis (HT medium (a medium in which 
aminopterin has b en removed from HAT medium) is firstly used, and the nomial medium is secondly used), and a 
hybridoma which is stable and contains a sufficient amount of antibody titer is selected as a hybridoma capable of 
s producing a monoclonal antibody of the present inv ntion. 

(d) Preparation of monoclonal antibody 

[0307] The monoclonal antibody-producing hybridoma cells obtained in (c) are injected intraperitoneally into 8- to 
^0 10-week-old mice or nude mice treated with pristane (intraperitoneal administration of 0.5 ml of 2,6,1 0,1 4-tetrameth- 
ylpentadecane (pristane), followed by 2 weeks of feeding) at 5X10^ to 20x1 0^ cells/animat. The hybridoma causes 
ascites tumor in 1 0 to 21 days. 

[0308] The ascitic fluid is collected from the mice or nude mice, and centrifuged to remove solid contents at 3000 
rpm for 5 minutes. 

15 [0309] A monoclonal antibody can be purified and isolated from the resulting supernatant according to the method 
similar to that used in the polyclonal antibody. 

[031 0] The subclass of the antibody can be determined using a mouse monoclonal antibody typing kit or a rat mon- 
oclonal antibody typing kit. The polypeptide amount can be detennined by the Lowry method or by calculation based 
on the absorbance at 280 nm. 

20 [031 1] The antibody obtained in the at>ove is within the scope of the antibody of the present invention. 

[0312] The antibody can be used for the general assay using an antibody, such as a radioactive material labeled 
immunoassay (RIA), competitive binding assay, an immunotissue chemical staining method (ABC method, CSA meth- 
od, etc.), immunoprecipitation. Western blotting, ELtSA assay, and the like {An introduction to Radioimmunoassay and 
Related Techniques, Elsevier Science (1986); Techniques in Immunocytochemistry, Academic Press, Vol. 1 (1982), 

25 Vol. 2 (1 983) & Vol. 3 (1 985); Practice and Theory of Enzyme Immunoassays, Elsevier Science (1 985); Enzyme-linked 
Immunosorbent Assay (ELISA), Igaku Shoin (1976) ; Antit)odieS'A Laboratory Manual, Cold Spring HariDor laboratory 
(1988); Monoclonal Antibody Experiment Manual, Kodansha Scientific (1987); Second Series Biochemical Experiment 
Course, Vol. 5, Immunobiochemistry Research Method, Tokyo Kagaku Dojin (1986)). 
[0313] The antibody of the present invention can be used as it is or after being labeled with a label. 

30 [0314] Examples of the label include radioisotope, an affinity label (e.g., biotin, avidin, or the like), an enzyme label 
(e.g., horseradish peroxidase, alkaline phosphatase, or the like), a fluorescence label (e.g., FITC, rhodamine, or th 
like), a label using a rhodamine atom, (J. Histochem. Cytochem., 18: 315 (1970); Meth, Enzym., 62. 308 (1979); Im- 
munol., 109: 129 (1972); J. Immunol., Meth., 13: 215 (1979)), and the like. 

[031 5] Expression of the polypeptide of the present invention, fluctuation of the expression, the presence or absence 
35 of structural change of the polypeptide, and the presence or absence in an organism otiier tiian coryneform bacteria 
of a polypeptide conresponding to the polypeptide can be analyzed using the antibody or the labeled antibody by the 
above assay, or a polypeptide array or proteome analysis described below. 

[0316] Furthenmore, the polypeptide recognized by tiie antibody can be purified by immunoaffinity chromatography 
using the antibody of the present invention. 

40 

12. Production and use of polypeptide an^ay 

(1) Production of polypeptide anray 

45 [0317] A polypeptide array can be produced using the polypeptide of the present invention obtained in the above 
item 1 0 or the antibody of the present invention obtained in the above item 11 . 

[031 8] The polypeptide anray of the present invention includes protein chips, and comprises a solid support and th 
polypeptide or antibody of the present invention adhered to the surface of the solid support. 
[0319] Examples of the solid support include plastic such as polycarbonate or the like; an acrylic resin, such as 
50 potyacrylamide or the like; complex cariDohydrates, such as agarose, sepharose, or the like; silica; a silica-based ma- 
terial, carbon, a metal, inorganic glass, latex b ads, and the lik . 

[0320] The polypeptides or antibodies according to the present invention can be adhered to th surface of the solid 
support according to the method described in Biotechniques, 27: 1258-61 (1999); Molecular Medicine Today, 5: 326-7 
(1999); Handbook of Experimental Immunology 4th edition, Blackw II Scientific Publications, Chapter 10 (1986); Meth. 
55 Enzym., 34 (1974); Advances in Experimental Medicine and Biology, 42 (1974); U.S. Patent 4,681 ,870; U.S. Patent 
4.282.287; U.S. Pat nt 4,762,881, rthe like. 

[0321] The analysis described herein can be efficientiy perfomned by adhering the polypeptid or antibody of the 
present invention to th solid support at a high density, tiiough a high fixation density is not always necessary. 
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(2) Use of polypeptide array 

[0322] A polypeplid or a compound capable of binding to and interacting with the polypeptides of th pres nt'in- 
vention adhered to the anray can be identified using the polypeptide array to which the polypeptides of the present 
invention have been adh red thereto as described In the above (1). 

[0323] Specifically, a polypeptide or a compound capable of binding to and interacting with the polypeptides of the 
present invention can be identified by subjecting the polypeptides of the present invention to the following steps (i) to (iv): 

(I) preparing a polypeptide array having the polypeptide of the present invention adhered thereto by the method 
of theat>ove(1); 

(ii) incubating the polypeptide immobilized on the polypeptide an'ay together with at least one of a second polypep- 
tide or compound; 

(ill) detecting any complex f omned between the at least one of a second polypeptide or compound and the polypep- 
tide immobilized on the aray using, for example, a label bound to the at least one of a second polypeptide or 
compound, or a secondary label which specifically binds to the complex or to a component of the complex after 
unbound material has been removed; and 
(iv) analyzing the detection data. 

[0324] Specific examples of the polypeptide array to which the polypeptide of the present invention has been adhered 
include a polypeptide anray containing a solid support to which at least one of a polypeptide containing an amino acid 
sequence selected from SEQ ID NOS:3602 to 7001, a polypeptide containing an amino acid sequence in which at 
least one amino acids is deleted, replaced, inserted or added in the amino acid sequence of the polypeptide and having 
substantially the same activity as that of the polypeptide, a polypeptide containing an amino acid sequence having a 
homology of 60% or more with the amino acid sequences of the polypeptide and having substantially the same activity 
as that of the polypeptides, a partial fragment polypeptide, and a peptide comprising an amino acid sequence of a part 
of a polypeptide. 

[0325] The amount of production of a polypeptide derived from corynef onm bacteria can be analyzed using a polypep- 
tide array to which the antibody of the present invention has been adhered in the above (1). 
[0326] Specifically, the expression amount of a gene derived from a mutant of coryneform bacteria can be analyzed 
by subjecting the gene to the following steps (i) to (iv): 

(i) preparing a polypeptide anray by the method of the above (1); 

(ii) incubating the polypeptide an-ay (the first antibody) together with a polypeptide derived from a mutant of co- 
ryneform bacteria; 

(iii) detecting the polypeptide bound to the polypeptide immobilized on the array using a labeled second antibody 
of the present invention; and 

(Iv) analyzing the detection data. 

[0327] Specific exiamples of the polypeptide anray to which the antibody of the present invention is adhered includ 
a polypeptide an-ay comprising a solid support to which at least one of an antibody which recognizes a polypeptide 
comprising an amino acid sequence selected from SEQ ID NOS:3502 to 7001, a polypeptide comprising an amino 
acid sequence in which at least one amino acids is deleted, replaced, inserted or added in the amino acid sequence 
of the polypeptide and having substantially the same activity as that of the polypeptide, a polypeptide comprising an 
amino acid sequence having a homology of 60% or more with the amino acid sequences of the polypeptide and having 
substantially the same activity as that of the polypeptides, a partial fragment polypeptide, or a peptide comprising an 
amino acid sequence of a part of a polypeptide. 

[0328] A fluctuation in an expression amount of a specific polypeptide pan be monitored using a polypeptide obtained 
in the time course of culture as the polypeptide derived from corynefomi bacteria. The culturing conditions can b 
optimized by analyzing the fluctuation. 

[0329] When a polypeptide derived from a mutant of coryneform bacteria Is used, a mutated polypeptide can be 
detect d. 

13. Identification of useful mutation in mutant by proteome analysis 

[0330] Usually, the proteome is used h rein to refer to a method wherein a polypeptide is separated by twodimen- 
sional electrophoresis and the separated polypeptide Is digested with an enzyme, followed by identification of the 
polypeptide using a mass spectrem ter(MS)ands arching a data bas . 

[0331 ] The two dimensional electrophoresis means an electrophoretic method which is performed by combining two 
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electrophoretic procedures having differ nt principles. For xampi , pofypeptides ar separated depending on molec- 
ular weight in th primary electrophoresis. Next, the gel is rotated by 90** or 180** and the secondary electrophoresis 
is cam'ed out depending on isoelectric point. Thus, various separation patt ms can t>e achieved (JIS K 3600 2474). 
[0332] In searching the data base, the amino acid sequence infomiation of the polypeptides of the present inv ntion 
and the recording medium of the present invention provide for in the abov items 2 and 8 can be used. 
[0333] The proteome analysis of a corynefonn bacterium and its mutant makes it possible to identify a polypeptid 
showing a fluctuation therebetween. 

[0334] The proteome analysis of a wild type strain of corynefomi bacteria and a production strain showing an im- 
proved productivity of a target product makes it possible to efficiently identify a mutation protein which is useful in 
breeding for improving the productivity of a target product or a protein of which expression amount is fluctuated. 
[0335] Specifically, a wild type strain of corynefomi bacteria and a lysine-producing strain thereof are each subjected 
to the proteome analysis. Then, a spot increased in the lysine-producing strain, comparedwith the wild type strain, is 
found and a data base is searched so that a polypeptide showing an increase in yield in accordance with an increase 
in the lysine productivity can be identified. For example, as a result of the proteome analysis on a wild type strain and 
a lysine-producing strain, the productivity of the catalase having the amino acid sequence represented by SEQ ID NO: 
3785 is increased in the lysine-producing mutant. 

[0336] As a result that a protein having a high expression level is identified by proteome analysis using the nucleotide 
sequence infomiation and the amino acid sequence infomiation, of the genome of the coryneform bacteria of th 
present invention, and a recording medium storing the sequences, the nucleotide sequence of the gene encoding this 
protein and the nucleotide sequence in the upstream thereof can be searched at the same time, and thus, a nucleotide 
sequence having a high expression promoter can be efficiently selected. 

[0337] In the proteome analysis, a spot on the two-dimentional electrophoresis gel showing a fluctuation is sometimes 
derived from a modified protein. However, the modified protein can be efficiently identified using the recording medium 
storing the nucleotide sequence information, the amino acid sequence information, of the genome of corynefomi bac- 
teria, and the recording medium storing the sequences, according to the present invention. 
[0338] Moreover, a useful mutation point in a useful mutant can be easily specified by searching a nucleotide se- 
quence (nucleotide sequence of promoters, ORF, or the like) relating to the thus identified protein using a recording 
medium storing the nucleotide sequence information and the amino acid sequence information, of the genome of 
corynefomi bacteria of the present invention, and a recording rnedium storing the sequences and using a primer de- 
signed on the basis of the detected nucleotide sequence. As a result that the useful mutation point is specified, an 
industrially useful mutant having the useful mutation or other useful mutation derived therefrom can be easily bred. 
[0339] The present invention will be explained in detail below based on Examples. However, the present invention 
is not limited thereto. 

Example 1 

Detemiination of the full nucleotide sequence of genome of Corynebacterium glutamicum 

[0340] The full nucleotide sequence of the genome of Corynebacterium glutamicum was determined based on the 
whole genome shotgun method {Science, 269, 496-512 (1995)). In this method, a genome library was prepared and 
the terminal sequences were detemiined at random. Subsequently, these sequences were llgated on a computer to 
cover the full genome. Specifically, the following procedure was earned out. 

(1) Preparation of genome DNA of Corynebacterium glutamicum ATCC 13032 

[0341] Corynebacterium glutamicum ATCC 13032 was cultured in BY medium (7 g/l meat extract, 10 g/l peptone. 3 
gf\ sodium chloride, 5 g/l yeast extract. pH 7.2) containing 1 % of glycine at 30**C ovemight and the cells were collected 
by centrifugation. After washing with STE buffer (10.3% sucrose. 25 mmol/l Trie hydrochloride. 25 mmol/l EDTA. pH 
8.0), the cells were suspended In 10 ml of STE buffer containing 10 mg/ml lysozyme. followed by gently shaking at 
37'*C for 1 hour. Then, 2 ml of 10% SDS was added thereto to lyse the cells, and the resultant mixture was maintained 
at 65*'C for 1 0 minutes and then cooled to room temperature. Then, 1 0 ml of Tris-neutralized phenol was added ther to, 
followed by gently shaking at room temperature for 30 minutes and centrifugation (15,000 x g, 20 minutes. 20**C). The 
aqueous layer was s parated and subjected to extraction with phenot/chloroform and extraction with chlorofonn (twice) 
in the sarne manner. T the aqu ous lay r. 3 mol/l sodium acetat solution (pH 5.2) and isopropanol were added at 
1/10 times volume and twice volume, respectively, followed by gently stim'ng to precipitate the genome DN/^ The 
g nome DNA was dissolved again in 3 ml of TE buffer (1 0 mmol/l Tris hydrochloride, 1 mmol/l EDTA, pH 8.0) containing 
0.02 mg/ml of RNase and maintained at 37**C for 45 minutes. The extractions with phenol, phenol/chloroform and 
chloroform were carri d out successiv ly in the same manner as th above. The genome DNA was subjected to iso- 
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propanol precipitation. The thus formed genome DNA precipitate was washed with 70% thanol three times, followed 
by air-drying, and dissolved in 1.25 ml of TE buffer to give a genome DNA solution (concentration: 0.1 mg/ml). , 

(2) Construction of a shotgun library 

[0342] TE buffer was added to 0.01 mg of the thus prepared genome DNA of Corynebacterium glutamicum ATCC 
13032 to give a total volume of 0.4 ml, and the mixture was treated with a sonicator (Yamato Powersonic IVIodel 160) 
at an output of 20 continuously for 5 seconds to obtain fragments of 1 to 10 kb. The genome fragments were blunt- 
ended using a DNA blunting kit (manufactured by Takara Shuzo) and then fractionated by 6% polyacrylamlde gel 
electrophoresis. Genome fragments of 1 to 2 kb were cut out from the gel, and 0.3 ml MG elution buffer (0.5 mol/l 
ammonium acetate, 1 0 mmol/l magnesium acetate, 1 mmol/l EDTA, 0. 1 % SDS) was added thereto, followed by shaking 
at 37*C ovemfght to elute DNA. The DNA eiuate was treated with phenol/chlorofonm, and then precipitated with ethanol 
to obtain a genome library insert. The total insert and 500 ng of pUCIB Smal/BAP (manufactured by Amersham Phar- 
macia Biotech) were llgated at 16^0 for 40 hours. 

[0343] The ligation product was precipitated with ethanol and dissolved in 0.01 ml of TE buffer. The ligation solution 
(0.001 ml) was introduced into 0.04 ml of £ coli ELECTRO MAX DH10B (manufactured by Life Technologies) by th 
electroporation under conditions according to the manufacture's instructions. The mixture was spread on LB plate 
medium (LB medium (10 g/l bactotrypton, 5 g/l yeast extract, 10 g/l sodium chloride. pH 7.0) containing 1.6% of agar) 
containing 0.1 mg/ml ampicillin, 0.1 mg/ml X-gal and 1 mmol/l isopropyl-p-D-thiogalactopyranoside (IPTG) and cultured 
at 37**C overnight. 

[0344] The transfomnant obtained from colonies formed on the plate medium was stationarily cultured in a 96-w II 
titer plate having 0.05 ml of LB medium containing 0.1 mg/ml ampicillin at 37°C ovemight. Then, 0.05 ml of LB medium 
containing 20% glycerol was added thereto, followed by stining to obtain a glycerol stock. 

(3) Construction of cosmid library 

[0345] About 0.1 mg of the genome DNA of Corynebacterium glutamicum ATCC 13032 was partially digested with 
Sat^AI (manufactured by Takara Shuzo) and then ultracentrifuged (26,000 rpm, 18 hours. 20*^0) under 10 to 40% 
sucrose density gradient obtained using 10% and 40% sucrose buffers (1 mol/l NaCI, 20 mmol/l Tris hydrochloride, 5 
mmol/l EDTA, 1 0% or 40% sucrose, pH 8.0). After the centrifugation, the solution thus separated was fractionated into 
tubes at 1 ml in each tube. After confinming the DNA fragment length of each fraction by agarose gel electrophoresis, 
a fraction containing a large amount of DNA fragment of about 40 kb was precipitated with ethanol. 
[0346] The DNA fragment was ligated to the BamH\ site of superCosI (manufactured by Stratagene) in accordance 
with the manufacture's instmctions. The ligation product was incorporated into Escherichia co// XL-1-BlueMR strain 
(manufactured by Stratagene) using Gigapack III Gold Packaging Extract (manufactured by Stratagene) in accordanc 
with the manufacture's instructions. The Escherichia co// was spread on LB plate medium containing 0.1 mg/ml amp- 
icillin and cultured therein at 37''C ovemight to isolate colonies. The resulting colonies were stationarily cultured at 
37**C ovemight In a 96-well titer plate containing 0.05 ml of the LB medium containing 0.1 mg/ml ampicillin in each 
well. LB medium containing 20% glycerol (0.05 ml) was added thereto, followed by stirring to obtain a glycerol stock. 

(4) Detennination of nucleotide sequence 
(4-1) Preparation of template 

[0347] The full nucleotide sequence of Corynebacterium glutamicum ATCC 1 3032 was determined mainly based on 
the whole genome shotgun method. The template used in the whole genome shotgun method was prepared by the 
PCR method using the library prepared In the above (2). 

[0348] Specifically, the clone derived from the whole genome shotgun library was inoculated using a replicator (man- 
ufactured by GENETIX) Into each well of a 96-weli plate containing the LB medium containing 0.1 mg/ml of ampicillin 
at 0.08 ml per each well and then stationarily cultured at 37*C ovemight. 

[0349] Next, the culturing solution was transported using a copy plate (manufactured by Tokk n) into a 96-well re- 
action plate (manufactured by PE Biosystems) containing a PCR reaction solution (TaKaRa Ex Taq (manufactured by 
Takara Shuzo)) at 0.08 ml per each well. Then, PCR was carried out in accordance with th protocol by Makino et aL 
{DNA Research, 5: 1-9 (1998)) using Gen Amp PCR System 9700 (manufactured by PE Biosystems) to amplify th 
inserted fragment. 

[0350] The excessiv primers and nucleotides wer eliminated using a kit for purifying a PCR production (manufac- 
tured by Amersham Pharmacia Biotech) and the residue was used as the t mplate in th sequencing reaction. 
[0351] Some nucleotid sequences were d termined using a double-stranded DNA plasmid as a template. 
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[0352] The double-stranded DNA plasmid as the templat was obtained by the following method. 
[0353] The clon derived from the whole genome shotgun library was inoculated into a 24- or 96-weII plate containing 
a2x YTmedium(16g/lbactotrypton, 10g/lyeast xtract, 5 g/1 sodium chlorid ,pH 7,0) containing 0.05 mg/mlampicillin 
at 1.5 ml per each well and then cultur d under shaking at 3TC ovemight. 
5 [0354] The double-stranded DNA plasmid was prepared from the culturing solution using an automatic plasmid pre- 
paring machine, KURABO PI-50 (manufactured by Kurabo Industries) or a multiscreen (manufactured by Millipore) in 
accordance with the protocol provided by the manufacturer. 

[0355] To purify the double-stranded DNA plasmid using the multiscreen, Biomek 2000 (manufactured by Beckman 
Coulter) or the like was employed. 
10 [0356] The thus obtained double-stranded DNA plasmid was dissolved in water to give a concentration of about 0. 1 
mg/ml and used as the template in sequencing. 

(4-2) Sequencing reaction 

15 [0357] To 6 fJ of a solution of ABI PRISM BigDye Temninator Cycle Sequencing Ready Reaction Kit (manufactured 
by RE Biosystems). an M13 regular direction primer (M13-21) or an M13 reverse direction primer (M13REV) (DAM 
Research, 5: 1-9 (1998) and the template prepared in the above (4-1) (the PCR product or the plasmid) were add d 
to give 10 ^1 of a sequencing reaction solution. The primers and the templates were used in an amount of 1.6 pmol 
and an amount of 50 to 200 ng, respectively. 

20 [0358] Dye terminator sequencing reaction of 45 cycles was canried out with GeneAmp PCR System 9700 (manu- 
factured by PE Biosystems) using the reaction solution. The cycle parameter was determined in accordance with th 
manufacturer's instruction accompanying ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit. The 
sample was purified using Multiscreen HV plate (manufactured by Millipore) according to the manufacture's instruc- 
tions. The thus purified reaction product was precipitated with ethanol, followed by drying, and then stored in the dark 

25 at -30*^0. 

[0359] The dry reaction product was analyzed by ABI PRISM 377 DNA Sequencer and ABI PRISM 3700 DNA An- 
alyzer (both manufactured by PE Biosystems) each in accordance with the manufacture's instructions. 
[0360] The data of about 50,000 sequences in total (i.e., about 42.000 sequences obtained using 377 DNA Sequenc- 
er and about 8,000 reactions obtained by 3700 DNA Analyser) were transferred to a sender (Alpha Sen/er 4100: man- 
so ufactured by COMPAQ) and stored. The data of these about 50.000 sequences corresponded to 6 times as much as 
the genome size. 

(5) Assembly 

35 [0361] All operations were canied out on the basis of UNIX platform. The analytical data were output in Macintosh 
platform using X Window System. The base call was carried out using phred (The University of Washington). The 
vector sequence data was deleted using SPS Cross.Match (manufactured by Southwest Parallel Software). The as- 
sembly was canied out using SPS phrap (manufactured by Southwest Parallel Software; a high-speed version of phrap 
(The University of Washington)). The contig obtained by the assembly was analyzed using a graphical editor, consed 

^ (The University of Washington). A series of the operations from the base call to the assembly were carried out simul- 
taneously using a script phredPhrap attached to consed. 

(6) Detemnination of nucleotide sequence in gap part 

45 [0362] Each cosmid in the cosmid library constructed in the above (3) was prepared by a method similar to th 
preparation of the double-stranded DNA plasmid described in the above (4-1). The nucleotide sequence at the end of 
the inserted fragment of the cosmid was detennined by using ABI PRISM BigDye Temninator Cycle Sequencing Ready 
Reaction Kit (manufactured by PE Biosystems) according to the manufacture's instructions. 
[0363] About 800 cosmid clones were sequenced at both ends to search a nucleotide sequence in the contig derived 

50 from the shotgun sequencing obtained in the above (5) coincident with the sequence. Thus, the linkage between re- 
spectiv cosmid clones and respectiv contigs w r determined and mutual alignment was cam* d out. Furthermore, 
. the results were compared with the physical map of Corynebacterium gtutamicum ATCC 13032 (MoL Gen. Genet, 
252. 255-265 (1996) to canying out mapping between the cosmids and the contigs. 

[0364] The sequenc in the region which was not covered with the contigs was detennined by the following method. 
55 [0365] Clones containing sequences position d at the ends of contigs were selected. Among these clones, about 
1,000 clones wher in only on end of the inserted fragm nthadb endet nnined were selected and th sequence at 
th opposite nd of th inserted fragment was det nnined. A shotgun library clone or a cosmid clone containing the 
sequences at the respectiv nds of the ins rted fragment in two contigs was identified, the full nucleotide sequence 
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of the inserted fragment of this clone was det nmined. and thus the nucleotide sequence of the gap part was detennined. 
When no shotgun library clone or cosmid clone covering the gap part was available, primers complementary toihe 
end sequences at the two contigs w re prepared and the DNA fragment in th gap part was amplified by PGR. Then, 
sequencing was perfomned by th primer walking method using the amplified DNA fragment as a template or by the 
5 shotgun method in which the sequence of a shotgun clon prepar d from the amplifi dDNAfragm nt was determined. 
Thus, the nucleotide sequence of the domain was determined. 

[0366] In a region showing a low sequence precision, primers were synthesized using AUTOFINISH function and 
NAVIGATING function of consed (The University of Washington) and the sequence was detemilned by the primer 
walking method to improve the sequence precision. The thus detemnined full nucleotide sequence of the genome of 
10 Corynebacterium glutamicum ATCC 1 3032 strain is shown in SEQ ID NO: 1 . 

(7) Identification of ORF and presumption of its function 

[0367] ORFs in the nucleotide sequence represented by SEQ ID N0:1 were identified according to the following 
15 method. Rrst, the ORF regions were determined using software for identifying ORF, i.e., Glimmer, GeneMaric and 
GeneMark.hmm on UNIX platfonr) acconjing to the respective manuaf attached to the software. 
[0368] Based on the data thus obtained, ORFs in the nucleotide sequence represented by SEQ ID N0:1 were iden- 
tified. 

[0369] The putative function of an ORF was determined by searching the homology of the identified amino acid 
20 sequence of the ORF against an amino acid database consisting of protein-encoding domains derived from Swiss- 
Prot, PIR or Genpept database constituted by protein encoding domains derived from GenBank database. Frame 
Search (manufactured by Compugen), or by searching the homology of the identified amino acid sequence of the ORF 
against an amino acid database consisting of protein-encoding domains derived from Swiss-Prot, PIR or Genpept 
database constituted by protein encoding domains derived from GenBank database, BLAST. The nucleotide sequences 
25 of the thus detemnined ORFs are shown in SEQ ID N0S:2 to 3501 . and the amino acid sequences encoded by these 
ORFs are shown in SEQ ID NOS:3502 to 7001 . 

[0370] In some cases of the sequence listings in the present invention, nucleotide sequences, such as TTG, TGT, 
GGT, and the like, other than ATG, are read as an initiating codon encoding Met. 

[0371] Also, the preferred nucleotide sequences are SEQ ID N0S:2 to 355 and 357 to 3501 , and the preferred amino 

30 acid sequences are shown in SEQ ID NOS:3502 to 3855 and 3857 to 7001 

[0372] Table 1 shows the registration numbers in the above-described databases of sequences which were judg d 
as having the highest homology witii the nucleotide sequences of the ORFs as tiie results of the homology search in 
the amino acid sequences using the homology-searching software Frame Search (manufactured by Compugen), 
names of tiie genes of these sequences, the functions of the genes, and the matched length, identities and analogies 

35 compared with publicly known amino acid translation sequences. Moreover, ttie corresponding positions were con- 
firmed via the alignment of the nucleotide sequence of an arbitrary ORF with the nucleotide sequence of SEQ ID NO: 
1 . Also, the positions of nucleotide sequences other than the ORFs (for example, ribosomal RNA genes, transfer RNA 
genes, IS sequences, and the like) on the genome were determined. 

[0373] Fig. 1 shows the positions of typical genes of the Corynebacterium glutamicum ATCC 1 3032 on the genome. 
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dehydrogenase 


myo-inositol catabolism 1 
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rhizopine catabolism protein 
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Streptomyces coelicolor A3(2] 
SC8F1 1.03c 


Streptomyces coelicolor msdA 
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Function 




arabinosyt transferase | 


hypothetical membrane protein 


acetoacetyl CoA reductase | 


oxidoreductase 








proteophosphoglycan | 


hypothetical protein 




hypothetical protein 


rhamnosyl transferase 




hypothetical protein 


O-antigen export system ATP- 
binding protein 


O-antigen export system permease 
protein 


hypothetical protein 


NADPH quinone oxidoreductase 
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length 
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Similarity 
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70.6 


66.1 


56.5 


85.1 








57.4 


83.9 




73.8 


79.1 




55.1 


78.4 


75.6 


63.0 


71.5 
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35.0 
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Homologous gene 




Mycobacterium avium embB 


Mycobacterium tuberculosis 
H37RV Rv3792 
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jO 

CL 

eL 
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o 
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o 

TJ 
3 
« 

0. 


Mycobacterium tuberculosis 
H37RV Rv3790 








Leishmania major ppgl 


Mycobacterium tuberculosis 
H37RvRv3789 




Mycobacterium tuberculosis 
H37RV Rv1864c 


Mycobacterium tuberculosis 
H37RV Rv3782 rfbE 




Agrobacterium tumefaciens 
plasmid pTi-SAKURA tiorflOO 


Yersinia enterocolitica rfbE 


Yersinia enterocolitica rfbO 


Mycobacterium tuberculosis 
H37RV Rv3778c 


Homo sapiens pig3 
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215159 


215162 


216605 


216116 


217141 


217943 


220151 


220154 
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207011 


208989 


209966 


211455 


211768 
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212283 


212656 
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214121 
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216264 
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Function 




probable electron transfer protein 


amino .acid carrier protein | 




molybdopterin biosynthesis protein 
moeB (sulfurylase) 


molybdopterin synthase, targe 
subunit 


molybdenum cofactor biosynthesis 
protein CB 


co-factor synthesis protein 


molybdopterin co-factor synthesis 
protein 


hypothetical membrane protein 


molybdate-binding periplasmic 
protein 


molybdopterin converting factor 
subunit 1 


maltose transport protein 


hypothetical membrane protein 


histidinol-phosphate 
aminotransferase 
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length 
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Similarity 
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51.0 


75.8 




70.1 


75.3 


63.3 
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58.6 


70.5 


68.0 


70.8 


60.8 


76.9 


65.8 










Identity 
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35.0 


46.7 




43.8 


44,7 


33.5 


61.7 


34.5 




34.0 


37.5 


34.3 


36,4 


37.3 








Table 1 (continued) 


Homologous gene 




Mycobacterium tuberculosis 
H37RV Rv3571 


Bacillus subtilts alsT 




Synechococcus sp. PCC 7942 
moeB 


Arthrobacter nicotinovorans 
moaE 


Synechococcus sp. PCC 7942 
moaCB 


Arthrobacter nicotinovorans 
moaC 


Arthrobacter nioptinovorans j 
moeA 


Arthrobacter nicotinovorans 
modB 


Arthrobacter nicotinovorans 
modA 


Mycobacterium tuberculosis 
H37RV moaD2 


Thermococcus lltoralis malK 


Streptomyces coelicolor A3(2) 
ORF3 


Zymomonas mobilis hisC 








35 
40 


db Match 




PIR:A70606 


sp:ALST_BACSU 




m' 

QJ 
O 

o 
o 
a 
> 

Oir- 


prf.2403296D 


□l 
Z 
>- 

CQ 
O 

o 

CL 

w 


o 

S 

t: 
o. 


? 

00 
O 

> 


o> 

CO 

o 

CN 

t: 
cx 


prf:2403296E 


CO 
y~- 
00 

o 
O 

'o. 


prt2518354A 


O 
O 

! 

> 

'6. 

Ul 


O 

1 

I 

CL 

O . 












CN 
CO 

to 


CN 


1476 


Oi 
o 
Oi 


1083 


CO 

in 




468 


1185 


CO 
CN 


s 

to 


CN 
CO 


CN 




1023 


O) 


on 

CN 


o 

CN 


45 


Terminal 
(nt) 


221131 
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230928 


230931 


231848 


232260 


234818 


234910 


235409 
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223685 


224336 
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transcription factor 


alcohol dehydrogenase 


putrescine oxidase | 


magnesium ion transporter | 




Na/dicarboxylate cotransporter | 


oxidoreductase 


hypothetical protein 


nitrogen fixation protein | 






membrane transport protein 
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hypothetical membrane protein 






ABC transporter 
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transposase 
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Table 1 (continued) 


Homologous gene 
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Brucella abortus oxyR 


Bacillus stearothermophilus 
DSM 2334 adh 


Micrococcus rubens puo 


Borrelia burgdorferi mgtE 




Xenopus laevis 


Mycobacterium tuberculosis 
H37RvtyrA 


Mycobacterium tuberculosis 
H37RV Rv3753c 


Bradyrhizobium japonicum 






Mycobacterium tuberculosis 
H37RV Rv0507 mmpL2 


Zymomonas mobllis 


Bacillus subtilis ypdP 






Streptomyces glaucescens strW 


Bacillus subtilis gItX 




Pseudomonas syringae tnpA 
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Function 


aspartate transaminase 




DNA polymerase III holoenzyme tau 
subunit 




hypothetical protein 1 


recombination protein 


cobyric acid synthase 1 


UDP-N-acetylmuramyl tnpeptide 
synthetase 


DNA polymerase III epsilon chain 


hypothetical membrane protein 


aspartate kinase alpha chain 






extracytoplasmic function alternative 
Sigma factor 


vegetative catalase 1 






leucine-responsive regulatory 
protein 


brahched-chain amino acid transport 
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Table 1 (continued) 


Homologous gene 


Brevibacterium lactofermentum 
aspC 




Thermus thermophllus dnaX 




Bacillus subUlisyaaK 


Bacillus subtilis recR 


Heliobacillus mobilis cobQ 


Heliobacillus mobilis murC 


Mycobacterium tuberculosis 
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Corynebacteriunri glutamicum 
(Brevibacterium flavum) ATCC 
13032 orfX 


Corynebacterium glutamicum 
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Mycobacterium smegmatis sigE 


Bacillus subtilis katA 






Klebsiella pneumoniae Irp 
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UDP-N-acetylpyruvoylglucosamine 
reductase 
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Table 1 (continued) 
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hypothetical protein 
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shikimate transport protein 


3-dehydroshikimate dehydratase 


shikimate dehydrogenase 




putrescine transport protein 




iron(lll)-transport system permease 
proteih 




periplasmic-iron-bindina orotein 1 


uroporphyrin-lll Cnnethyltransferase 




15 


Matched 
length 
(aa) 


CN 






CO 

CD 
CN 






tn 
in 


OO 

o 

CO 




CN 
CO 




O) 

o 

CO 


CM 
00 
CN 




CO 
CO 

CO 




CO 

in 




CO 


CO 
00 
TT 




20 


Similarity 
(%) 


100.0 






77,4 


66.2 




74.3 


75.3 1 




57.6 


72.2 


57.9 


98.6 




66.6 




55.2 




59.9 


71.6 






Identity 
(%) 


89.7 






51.0 


40.5 




44.4 


50.7 




27.1 


35.5 


28.2 


98.2 




34.7 




25,1 




25.1 


46.5 




a S !S 

Table 1 (continued) 
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CO 
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o> 
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757630 


758384 
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763237 
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774150 
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746893 


748020 


748026 
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762468 
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774142 
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Function 


hypothetical protein 


dTDP-Rha:a-D.GIcNAc- 
diphosphoryl polyprenot. a-3-L- 
rhamnosyi transferase 


mannose-1-phosphate 
guanylyltransferase 


regulatory protein 


hypothetical protein 


hypothetical protein 


phosphomannomutase 


hypothetical protein 


mannose-6-phosphate isomerase 1 






pheromone-responsive protein 




S-adenosyl-L-homocystelne 
hydrolase 






thymidylate kinase 


IS 
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length 
(aa) 


CM 


o> 
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CM 


CO 

to 
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o> 
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o 
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^ 
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o 

CM 
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Similarity 
(%) 




77.9 


66.9 


81.9 


74.8 


71.3 


66.3 


56.3 


66.2 






57.8 




83.0 






56.0 






Identity 
(%) 


45.5 


56.4 


29.8 


73.4 


48.9 


51.5 


38.0 


31.2 


36.9 






35.6 
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25.8 


25 
30 
35 
40 


Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv3267 


Mycobacterium smegmatis 
mc2155wbbL 


Saccharomyces cerevisiae 
YDL055C MPG1 


Mycobacteriunri smegmatis 
whmD 


Mycobacterium tuberculosis 
H37RV Rv3259 


Streptomyces coelicolor A3(2) 
SCE34.11C 


Salmonella montevideo M40 
manB 


Mycobacterium tuberculosis 
H37RV Rv3256c 


Escherichia coli K12 manA 






Enterococcus faecalis plasmid 
pCFlO prgC 




Trichomonas vaginalis WAA38 
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Function 


two-component system response 
regulator 




two-component system sensor 
histidine kinase 


lipoprotein 


hypothetical protein 




30S ribosomal protein or chloroplast 
precursor 


preprotein translocase SecA subunit 




hypothetical protein 


hypothetical protein 


5-enolpyruvylshikimate 3-phosphate 
synthase 


hypothetical protein 
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synthase 
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RNA polymerase sigma factor 
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length . 
(aa) 


CM 
CN 




CO 


in 
in 


CO 
CM 




CO 

0 

CN 


in 

CO 




0 


CO 


Jo 


0 

CO 


CO 
CM 


0 
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65.6 
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99.6 




78.8 


82.9 
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63.9 


100.0 


42.4 


87.2 


Identity 
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29.6 


38.0 




34.5 
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47.1 


64.6 


99.0 


38.3 
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Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv3246cmtrA 




Mycobacterium tuberculosis 
H37RV Rv3245c mtrB 


Mycobacterium tuberculosis 
H37RV Rv3244c IpqB 


Mycobacterium tuberculosis 
H37RV RV3242C 
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Function 


hypothetical protein 


hypothetical protein 






hypothetical protein 


regulatory protein 


ethylene-inducible protein 


hypothetical protein 


hypothetical protein 




alpha-lytic proteinase precursor 




DNA-directed DNA polymerase 


major secreted protein PS1 protein 
precursor 




'.^ 






monophosphatase | 


15 


Matched 
(ength 
(aa) 




O 
Id 
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CO 


o 
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CN 
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tn 

CM 
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Similarity 
{%) 


76.4 


74.9 






73.5 


57.7 


69.0 


53.0 


73.6 








j 51.4 


51.5 










74.9 




Identity 
{%) 


42.6 


43.4 






47.2 


34.3 


67.4 


49.0 


40.8 




26.7 
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27.0 
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25 

a 
c 

c 
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o 
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C3 

£X 
CD 

F 

35 
40 


Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv3195 


Mycobacterium tuberculosis 
H37RV Rv3194 






Mycobacterium tuberculosis 
H37RV Rv3193c 


Deinococcus radiodurans 
DR0840 


Hevea brasiliensis laticifer eri 


Aeropyrum pernix K1 APE0247 


Bacillus subtilts 168 yaaE 




Lysobacter enzymogenes ATCC 
29487 




Neurospora intermedia LaBelle- 
1b mitochondrion plasmid 


Corynebacterium glutamicum 
(Brevibacterium flavum) ATCC 
17965 csp1 










Streptomyces albonlger pur3 
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836892 


839353 


640139 


840210 


640437 


841517 1 


50 


Initial 
. (nt) 


824125 


824190 


825916 


826517 


826616 


830985 


831021 


831922 


831971 


833157 


833572 


CO 
CO 
00 

CO 
00 


835253 


837312 


838925 


639630 


840431 


840745 


842296 




to 2 i 


4373 


4374 


4375 


4376 


4377 


4378 


4379 


4380 


4381 


4382 


4383 


4384 


4385 


CO 
00 
CO 


4387 


4368 


4389 


4390 


4391 


55 


SEQ 
NO. 
(DNA) 


CO 


^ 

00 


in 

CO 


fe 

CO 


CO 


00 
CO 


a> 
1^ 

CO 


o 

CO 
CO 


00 


CN 
CO 
CO 


863 


CD 
CD 


in 

OO 
CO 


<o 

CD 
CO 

) 


00 
00 


CO 
CO 
OO 


669 


o 

OO 


T- 

a> 

00 



EP1 108 790A2 



10 



15 



20 



25 



30 



0) 
3 

C 

o 
u 



0) 



55 



40 



45 



50 



55 



c 
o 

c 

3 



to g-^ 
5- 



2^ 

•n 

to 



o 
o 

E . 

o 

X 



JZ 

u 



(0 

CD 



o 

CL 

g 

o 
E 



(A 
o 
c 

A 

r 



CO 



in 



CO 
CO 



CL 
O 
> 



O 
>» 

E 
o 



CO CO 



CO 

o 
D 

CL 



(0 

c ^ 



(0 ^ 

S c 



292 

(O ^ 



LU ^ Z 



(N 



CM 



CM 
O 
CD 



O 



(0 
41 



55 
u 

0) 

Q. 

0) 

a. 



in 

CO 



00 



CO 
CO 



CO 

< 

o 
ii 
oi 
o 
o 
</> 

0) 

u 
>* 

E 
o 

Q. 



CO 



O 
O 

CO. 



o 



CO 


o 


r— 


o 


CO 




CO 


CO 




CM 




In 








00 


00 


00 



m 

CM 
CO 

CO 



o 

w 
CL 

D> 
C 

C 



c 
o 



> 



CO 



Ol 



o 



III 



(A 

'Si 
_o 

s 

^£ 

p CM 
JO > 

o a 

>»ro 



o 
111 



CO 



in 
cn 

CO 



a 



CO 
C7> 
fO 



CO 



CO 

oo 



Oi 
CO 



c 
'S 
o 

Q. 

C 
O 

'S 
*5 
tj 

oi 
o 



o 

CO 



cn 
d 



o 

3 
U 

3 *= 
O 

ii 
Is; 

o > 
O Ct 
o 

>*co 
2 I 



C7> 
O 

a 



§ 



O) 

o 

CO 
CO 



o> 
c 
*5 
c 
x> 

<i 

CO ® 

m ^ 
« S 

o. c 5 
E 2 

cn Q- JZ 



in 
1^ 



co 



CO 
a. 

E 

tn 



0) 

sz 
u 
1/1 
UJ 



O 
O 

s 

:s 

CO 



CM 

cn 



CO 
CN 
(O 
CO 

oo 



00 

cn 

00 



(O 

o 

CO 



CD 

O) 
CO 



CO 
CO 



cn 

CO 



CO 



CO 



o. 



eg 

N 



CO 

o 
> 



cn 

oi 
in 



CD 

CM 



in 

(O 

i 

o 

0} 

o 
jr 
u 
o 

5 3 
> > 



X 

o 

CD 



CO 
> 



tn 

CM 
CO 



s 



CO 

CM 
CO 
(7) 
TT 
00 



CM 
O 



CM 
O 
CD 



C 

S 
o. 

c 

CO 

e 

U 

f 

£X 

x: 



o> 



CM 



CO 
CO 



0) 

n 

.2 o 

m CO 

I Si 



CO 

CD 
O 



CO 

tn 



CM 

o 
in 

00 



cn 
en 
a> 
o 
in 

GO 



o 



o 
cn 



c 

o 
Ol 
O) 

c 

c 

IS 
c 

CO 

Is 

•ii 



in 

CM 

CO 



tn 

CO 



CM 



s 

m 



3 
O) 

c 
(n 

o 

•c 

*> 



< 

CD 

I' 



CO 
CO 
CM 

in 

CO 



in 

CO 



00 



o 



in 
o 
o> 



o 

C 
CO 
w 

O 
CD 
< 

E 2 
p (0 

^ s 

o o. 



CO 



00 
CO 



CO 
CO 



00 
CO 



to 
to 



u 

CO 

CD 



CO 

CD 
cn 

CO 

CQ 
l: 



o> 
cn 
o> 



CO 
CO 

tn 



0) 

t: 
o 

Q. 

to 
c 

(0 
^ 

O 
m 
< 

£ 2 

O (0 

o c 



o 



in 

CO 



O 
c5 
>t 

oo 

CD 



O 
CO 

CD 



C7> 
CD 
O 

l: 



CM 

cn 



xr 
in 

CO 



4> 

t: 
o 
o. 

40 
C 
CO 



§1 

I 2 
1 = 



CM 



CM 
OO 



CO 



oo 

CO 



JQ 
3 
CO 
CO 

_3 

u 
to 
CD 



CO 
CD 

r- 
cn 

CO 

O 



CO 

in 



CD 

in 
in 

CO 



CO 
CM 

m 

CO 



CO 

o 
cn 



CO 
00 

CO 

in 

CO 



o 



o 
<n 



CM 

in 

CO 



CO 

o 



GO 
O 

cn 



EP1 108 790A2 



10 



15 



20 



25 



30 



c 

o 
u 



35 



40 



45 



50 



55 



c 
o 



3 



CO 



1^ 



C 
0) 
O) 
CO 

O 
Ol 

o_ 
o 

E 
o 
X 



■o 



II 



(0 



(0 , 

It: 



w 2 2. 



26S 



o. 

"(0 

o 

'Si 

«^ 

o 
a 

>* 



CM 



O) 

E 
E 

!5 
E 



£ CM 

(0 Q 



CO 

u. 

Q. 



CO 

o 
o 

CO 

oo 



CM 
CM 



CO 



CJ) 

o 



o 



0) 

c 
E 

(0 

in w 
CO 2C 

0) <n CQ 
c p 

COW 
3 C *2 
g, E 2 



CM 



cri 

CO 



</) 

O 
0) 

o 



3 



o 

CM 
(O 
CO 

(0 

l; 



cn 
o 

CM 



CM 

m 

CM 
CD 
GO 



in 



CO 



m 
u 



CL 
0) 

< 

o 



CO 
(O 



CM 
CO 



o 

CO 



ca 

•5 CM 

o CK 

iS ^ 

85 

t5 
s> 

5y 

O 00 
U 00 
CO CM 

CO CO 



CO 
CD 



<0 
(O 

o 
m 

CO 
00 



0) 

Z 

o. 

75 
o 

a> 
o 

>v 



CO 



in 

CD 



CO 



3 
O 

3 

p CM 

^ > 
o a: 

2 X 



CO 

o 

LL 



c 
o 

w 
Q. 

75 
o 



o 



in 



CM 
CO 



o 



P CO 

Is 

x» > 

o 0^ 

O 1^ 

S X 



CO 

o 
O 



CD 
C7> 



m 

CO 
CO 



CO 



CO 

CO 
00 



CO 



C3) 



in 



CO 

m 

CO 



in 
5) 



CO 



c 

a 

75 
u 



o 

CL 
>» 



O) 

in 



CO 

m 



CO 
CM 



« CM 

S2 



in 
O 



o. 



m 

CM 

m 



to 

CO 
CO 



CO 

o 
o> 
o> 

CO 
CO 



cn I O) 



C7) 

5 



(A 

CD 

£ 

CA 



o 
o 
>» 
u 
a> 
c 

E 



CO 
CM 



CO 



(A 
C 

2 

3 

■o 

{0 

(A 
3 
O 
U 

O CM 

U T- 

O ^ 

c o 
*d} a: 
o a 



CO 

o 
o 

UJ 

< 



1^ 



Oi 
O) 
CD 
CO 



CD 
O 

GO 



o 

CM 



o 

CM 
C7) 



(A 
CO 



o> 



CO 



o 

75 
o 
u 

CA 
O 

§.C« 

E9 

Q. VJ 
Q) CO 

O 
CO CO 



m 
O 

CO 

O 
CO 



CO 



CM 
CO 



00 
CO 

CO 



CM 

CM I 

2 1 



CO 
CM 



CO 
CM 



(A O 
O 'tr 

-c g 

CO sr 

5 * 
¥ E 



C7> 
CO 



m 



CM 



3 



lA 
U 

E 
o 

"5. 

CO 



z 

CO 

h- 



CO 
CM 
CO 



o 
o 

CO 



CM 
CO 



m 

CM 



in 

CM 

cn 



i ^ 

CM 



EP 1 108 790 A2 



5 
10 


Function 


hypothetical protein 


phosphoserine transaminase 1 


acetyl-coenzyme A carboxylase 
carboxy transferase subunit beta 


hypothetical protein 


sodium/proline symporter 1 


• 


hypothetical protein 


fatty-add synthase 






homoserine O-acetyKransferase 






glutaredoxin 


dihydrofolate reductase 


thymidylate synthase 


ammonium transporter 1 


ATP dependent DNA helicase 


formamidopyrimidine-ONA 
glycosidase 


15 


Matched 
length 
(aa) 


<o 

CO 


CO 


CO 
CO 
CN 


CO 

o 


cn 

CO 




CO 
CM 


3026 






if) 
CO 

CO 






CM 
CO 






CM 

o 

CN 


1715 


oo 
cn 

CN 


20 


Similarity 
1%) 


55,1 


52.9 1 


69.5 


80.6 


58,1 




77.4 


83.4 






59.7 






72.6 


1 62.0 


88.9 


56.4 


68.1 


51.0 




Identity 
(%) 


. 32.6 


21.9 


36,0 


51.5 


26.4 




o 


63.1 






29.0 






43.6 


38.0 


64.8 


32.2 


47.4 


29.2 


25 

■o 

Q) 
3 
C 

c 
o 

Q> 

|2 

35 


Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv0883c 


Bacillus circulans ATCC 21783 


Escherichia coli K12 accD 


Streptomyces coelicolor A3(2) 
SCI8.08C 


Pseudomonas.fluorescens 




Mycobacterium tuberculosis 
H37RV Rv2525c 


Corynebacterium 
ammoniagenes fas 






Leptospira meyeri metX 






Deinococcus radiodurans 
DR2085 


Mycobacterium avium folA 


Escherichia coti K12thyA 


Escherichia coli K12cysQ 


Streptomyces coelicolor A3(2) 
SC7C7.16C 


Synechococcus elongatus 
naegeli mutM 


40 


db Match 


sp:YZ11_MYCTU 


O) 
CO 

€0 

.H 
*a. 


sp:ACCD_ECOLI 


gp:SCI8_8 


plrJC2382 




plrA70657 


pirS55505 






prf;231 73358 






gp:AE002044_8 


prf:2408256A 


sp:TYSY_ECOU 


'sp:CYSQ_ECOLI 


CO 

i^' 

O 
h- 
O 
CO 

CL 

a> 


Ui 

z: 
> 
to 

o' 

GL 
U- 

' a. 
• <n 






CO 
CO 
O) 


1128 


1473 


cn 
ro 

CO 


1653 


CO 
00 


o 

GO 


8907 


cn 
oo 


(O 

to 


1047 


CO 
CM 


CO 
CN 


CO 
CM 


CD 

in 


00 
0> 


CD 

in 


4560 


CO 
CD 


45 


Terminal 
(nt) 


874951 


875985 


879642 


861985 


883647 


884541 


864549 


894578 


895191 


895593 


895596 


896719 


897689 


897727 


897979 


898434 


899253 


904602 


905382 


50 


InKial 
(nt) 


875883 


877112 


881114 


881647 


681995 


883726 


865388 


885672 


894703 


895408 


896642 


897144 


897423 1 


897963 


898434 


899231 


800006 


900043 


904615 




CO z ^ 


4427 


4428 


Oi 
CN 


4430 


4431 


4432 


4433 


4434 


4435 


4436 


4437 


4438 


4439 


4440 




CM 


4443 


4444 


4445 


55 


SEQ 
NO. 
(ONA) 


1^ 

S! 
o> 


CD 
CN 

o> 


Oi 
CN 


o 

CO 

o> 


si 


CN 
CO 

cn 


CO 
CO 
O) 


CO 

cn 


IO 

CO 

cn 


CD 
CO 

o> 


CO 

o> 


00 
CO 

cn 


cn 

CO 

cn 


o 
cn 


cn 


CN 

■ 


CO 

cn 




m 
cn 



EP1 108 790 A2 



5 
10 




Function 


hypothetical protein 


alkaline phosphatase 


integral membrane transporter 




glucose-6-phosphate isomease 


hypothetical protein 




hypothetical protein 


ATP-dependent helicase 


ABC transporter 


ABC transporter 




peptidase 


hypothetical protein 




5'-pKosphoribosylglyclnamide 
formyltransferase 


6 
o 

to 

la 

^§ 

XJ 10 

II 

m O 
O -D 

m ^ 


citrate lyase (subunit) 


15 




Matched 
length 
(aa) 


00 
CN 

<r— 


CD 
O) 


ro 

o 






to 
cn 




76 


CO 

je 


in 

00 
00 


CN 




CD 
CO 
CM 






cn 

00 


in 

CN 

in 


(N 






Similarity 
(%) 




Oi 


o 




o 


CO 




cn 




CO 






CO 


00 




CN 


CO 


100.0 


20 




(D 
CO 




CD 




r-. 


c4 
tn 




tri 

CD 


CO 


00 

■V 






CO 


c=» 
(O 




CD 
00 


CO 






Identity 
(%) 


55.5 


38.6 


33.6 




52.4 


24.6. 




59.0 


46.1 


21.8 


43.8 




43.6 


31.1 




64.6 

i 


74.5 


■ 100.0 


25 
30 
35 


Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv0870c 


Lactococcus lactis MG1363 apl 


Streptomyces coelicolor A3(2) 
SCI28.06C 




Escherichia coli JM101 pgi 


Mycobacterium tuberculosis 
H37RV Rv0336 




Mycobacterium tuberculosis 
H37RV Rv0948c 


Bacillus stearothermophilus 
NCA1503pcrA 


Streptomyces coelicolor A3(2) 
SCE25.30 


Bacillus subtilis 168 yvrO 




Mycobacterium tuberculosis 
H37RV Rv0950c 


Mycobacterium tuberculosis 
H37RV RV0955 




Corynebacterium 
ammoniagenes purN 


Corynebacterium 
ammoniagenes purH 


Corynebacterium glutamicum 
ATCC 13032 citE 


40 




db Match 


CO 

i 

u, 
'a. 


sp:APL_LACLA 


CD 

l; 
5. 




pir.NUEC 


pirG70506 




sp:YT26_MYCTU 


co 

CO 

o 
a 

Q. 

tn 


o 

"i 

in 

CN 
UJ 
O 

w 

a. 
cn 


prf 242041 OP 




CO 

o 

u: 
"a. 


sp:YT19_MYCTU 




cn 
m 

s 

o 
o> 


tn 

s 

o 

CL 

o> 


? 

CO 
CO 

Ij 
(D 
O 

CL 

cn 








CO 

o 


o 
o 

CO 


1173 




1620 


1176 


00 
CO 


cn 
o 

CO 


2289 


2223 


CD 
CD 
CD 


o 
tn 


T— 

r— 

1^ 


1425 


00 
CN 

CM 


1^ 

CN 
CO 


1560 


cn 

CO 


45 




Terminal 
(nt) 


905796 


905792 


' 906559 


909328 


907759 


909521 


911223 


910855 


913514 


913477 


915699 


916366 


916970 


919352 


917827 


919956 


921526 


922412 


50 




Initial 
(nt) 


905369 


906391 


907731 


908612 


1 909378 


910696. 


1 910843 


1 

; 911163 


911226 


1 915699 


916364 


916874 


917680 


917928 


918054 


919330 


919967 


921594 






SEQ 

NO. 
(a.a.) 


CO 

■a- 


4447 


CO 


4449 


4450 


4451 


4452 


4453 


^ 

in 


4455 


4456 


4457 


4458 


4459 


4460 


CO 

XT 


4462 


ro ! 
CD ; 
^ ! 

^ : 


55 




SEQ 
NO. 
(DNA) 


CD 
<7) 




CO 

cn 


Oi 
Oi 


o 
tn 
cn 


t— 
in 

O) 


CN 

m 
<n 


CO 

in 
cn 


tn 
cn 


in 
tn 
m 


<9 
in 

cn 


in 1 
cn 


CO 

tn 
cn 


656 


o 

(O 
O) 


<0 

o> 


CN 
CO 
CD 


CO 

to 
cn 



EP1 108 790 A2 



5 
10 




Function 


repressor of the high-affinity (methyl) 
ammonium uptake system 


hypothetical protein 




30S ribosomal protein SI 8 


30S ribosomal protein S14 


50S ribosomal protein L33 


50S ribosomal protein L28 1 


transporter (sulfate transporter) 


Zn/Co transport repressor 


50S ribosomal protein L31 
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transposase 


transposase subunit 
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Table 1 (continued) 
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Function 


hypothetical protein 












alkaline phosphatase D precursor 




hypothetical protein 
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DNA primase 


ribonuclease Sa 
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Table 1 (continued) 


Homologous gene 


Streptomyces coelicolor A3(2) 
SC4A7.08 












Bacillus subtilis 168 phoD 
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Function 


hypothetical protein 


hypothetical protein 




glycyl-tRNA synthetase | 


bacterial regulatory protein, arsR 
family 


ferric uptake regulation protein 


hypothetical protein (conserved in 
C.glutamicum?) 


hypothetical membrane protein 


undecaprenyl diphosphate synthase 


hypothetical protein 


Era-like GTP-binding protein 


hypothetical membrane protein 
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Function 


heat shock protein dnaJ 1 


heat-inducibte transcriptional 
repressor (groEL repressor) 


OKygen-independent 
coproporphyrinogen 111 oxidase 


agglutinin attachment subunit 
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Neisserial polypeptides predicted to 
be useful antigens for vaccines and 
diagnostics 


polypeptides predicted to be useful 
antigens for vaccines and 
diagnostics 






peptidyl-dipeptidase 


carboxylesterase | 


glycosyl hydrolase or trehalose 
synthase 




hypothetical protein 
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Homologous gene 


Streptomyces albus dnaJ2 


Streptomyces albus hrcA 


Bacillus stearothermophilus 
hemN 


Saccharomyces cerevisiae 
YNR044WAGA1 






Streptomyces coelicolor A3{2) 
SC6G10.04 


Escherichia coll K12malQ 


Lactobacillus brevis plasmid 
horA 


Neisseria gonon-hoeae 
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Salmonella typhimurium dcp 
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Mycobacterium tuberculosis 
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Function 


Isopentenyl-diphosphale Delta- 
isomerase 












beta lyase (degradation of 
aminoethylcvsteine) 


branched-chain amino acid transport 
system carrier protein (isoleucine 
uptake) 


atkanal monooxvaenase aloha chain 




malonate transporter 


glycolate oxidase subunit 


transcriptional reaulator 


1 


hypothetical orolein 




heme-binding protein A precursor 
(hemin-bindino liooorotein^ 


oligopeptide ABC transporter 
(permease) 


dipeptide transport system 
permease protein 


Oligopeptide transport ATP-binding 
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Corynebacterium glutamicum 
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Escherichia coli K12glcD 
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C7) 

in 
tn 

CM 



S to 

CM CN 



CO 
CD 

tn 

CM 



CD 

m 

CN 



m 

CD 

in 

CN 



CO 
CO 

m 

CM 



CO 

in 



EP1 108 790 A2 



5 
10 




Function 


hypothetical protein 


SOS nbosomal protein S20 1 


thrreonine efflux protein 


ankyrin-IIke protein 


hypothetical protein 


late competence operon required for 
ONA binding and uptake 


late competence operon required for 
ONA binding and uptake 




hypothetical protein 


phosphoglycerate mutase 


hypothetical protein 


hypothetical protein 




1 — ' — — 

g3mma*glutamyl phosphate 
reduct9se or glutamate-5- 
semldldehyde dehydrogenase 


D-isomer specific 2-hydroxyacid 
dehydrogeriase 




GTP-binding protein 


15 




Matched < 
length 
(aa) 


tn 

CO 


to 

00 


o 


O) 
CM 

T- 




1^ 

CM 

in 


m 




CO 
CM 


in 

CO 
CM 




o> 




CM 
CO 


o 

CO 




00 


20 




Similarity 
(%) 


69.7 


72.9 


67.1 


80.6 




49.7 


63.6 




66.3 


66.4 


86.3 


85.3 




99.8 


100.0 




78.2 






Identity 
(%) 


41.6 


48.2 


30.0 


61.2 


46.0 


21.4 


30.8 




34.8 


46.8 


55.6 


68.0 




99.1 


99.3 




56.9 


25 
30 
35 
40 


Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv2405 


Escherichia coli K12 rpsT 


Escherichia coll K1 2 rhtC j 


Streptomyces coelicolor A3(2) 
SC6D7.25. 


Mycobacterium tuberculosis 
H37RV Rv2413c 


Bacillus subtilis 168 comEC 


Bacillus subtilis 168 comEA 




Streptomyces coelicolor A3(2) 
SCC123.07C, 


Mycobacterium tuberculosis 
H37RV Rv2419c 


Mycobacterium tuberculosis 
H37RV Rv2420c 


Streptomyces coelicolor A3{2) 
SCC123.17C. 




Corynebacterium glutamicum 
ATCC 17965 proA 


Corynebacterium glutamicum 
ATCC 17965 unkdh 




Streptomyces coelicolor A3{2) 
obg 




db Match 


00 
"CO 

o 

X 

l: 
Q. 


sp:RS20_ECOLI 


sp:RHTC_ECOLI 


tn 

Q 
CO 

O 
CO 
d. 


. 00 
CD 
O 
1^ 
X 
l: 
Q. 


D 

CO 

o 
< 

CD 

r>' 

UJ 

o 

d. 

(A 


Z) ' 
CO 

o 
< 

CQ 
^1 

UJ 

o 




«' 

CM 

o 
o 

CO 

Q. 
CJ) 


in 
oo 

CD 

o 
u. 

Cl 


pir:G70685 


gp:SCC123_17 




sp:PROA_CORGL 


sp:YPRA_CORGL 




gp:D87915^1 






ORF 
(bp) 


CJ) 

o 

CO 


co 

CM 


CD 
CD 
CD 




in 
Oi 


1539 


CM 
00 

in 


CM 
CM 
00 


CM 
CM 
CO 


00 

o 


^ 


CD 


1023 


1296 


CM 
O) 




1503 


45 




Terminal 
(nt) 


2485269 


2485733 


2485801 


2486477 


2486910 


2487912 


2489573 


2491732 


2490290 


2491151 


2491873 


2492501 


2493215 


2494339 


2495696 


2497513 


2498009 


50 




Initial 
(nt) 


2484661 


2485473 


2466469 


2486681 


CO 
00 

CO 

CM 


2489450 


2490154 


2490911 


2491111 


2491858 


2492343 


2493178 


2494237 


2495634 


2496607 


2496803 


2499511 






CO 2 ™. 


6068 


6069 


6070 


6071 


6072 


6073 


6074 


6075 


6076 


6077 


6078 


6079 


6080 


6081 


6082 


6083 


6084 


55 


1 


SEQ 
NO. 
(ONA) 


2568 


2569 


2570 


2571 


2572 


2573 


2574 


2575 


2576 


2577 


2578 


2579 


2580 


2581 


2582 


2583 


2584 
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790 A2 



5 
10 


Function 


xanthine pemiease | 


2.5-diketo-D-gIuconic acid reductase 






50S ribosomal protein L27 


508 ribosomal protein L21 


ribonuclease E 








hypothetical protein 


transposase (insertion sequence 
IS31831) 


hypothetical protein 


hypothetical protein 


nucleoside diphosphate kinase | 




hypothetical protein 


hypothetical protein 


hypothetical protein 


15 


Matched 
length 
(aa) 


CM 
CM 

■V 


CO 
CM 






00 


o 


CO 
OO 
GO 








m 
cn 






CO 


CO 




CN 

o> 


CM 


00 




Similarity 
(%) 








































20 


77.3 


81.9 






92.6 


82.2 


56.6 








82.6 


100.0 


76.9 


67.8 


89.6 




67.4 


64.3 


68.6 




Identity 
(%) 


39.1 


CM 






80.3 


56.4 


30.1 








61.0 


99.1 


51.3 


37.8 


70.9 




34.8 


36.6 


33.9 


25 

3 
C 
•3 
C 

30 ^ 

0) 
CD 

35 
40 


Momologous gene 


Bacillus subtilis 168 pbuX 


Corynebacterium sp. ATCC 
31090 






Streptomyces griseus IF013189 
rpmA 


Streptomyces jgriseus IF013189 
obg 


Escherichia coli K12 rne 








Streptomyces coelicolor A3(2) 
SCF76.08C 


Corynebacterium glutamicum 
ATCC 31831 


Streptomyces coelicolor A3(2] 
SCF76.08C 


Streptomyces coelicolor A3(2) 
SCF76.09 


Mycobacterium smegmatis ndk 




Deinococcus radiodurans R1 
DR1844 


Mycobacterium tuberculosis 
H37RvRv1883c 


Mycobacterium tuberculosis 
H37RV Rv2446c 


db Match 

i 


CO 

o 
< 

CD 

x' 

D 
CD 
CL 

Ol 

tn 


CO 

CO 
00 

o 

l: 






cc 
iD 

cr 

t< 

CM 
—1 

d. 


% 

CD 
CM 
^ 
O 
CO 
CM 

t: 
o. 


sp:RNE_ECOLI 








00 

«' 

U. 

O 
CO 
b. 

O) 


pir.S43613 


*i 

CO 

u. 
O 
CO 

o. 
o» 


CO 
LL 

O 
CO 
Q. 
O) 


m 
o> 

u. 
< 




o 

CN 

o 
o 

!S 

O) 


in 

o 

X 

u* . 
o. 


pirE70863 






1887 


CO 
OO 


CM 

CO 


CO 
C7> 
CO 


CD 
CM 


CO 

o 

CO 


2268 


tn 


CO 

in 




Oi 
o 

CO 


1308 


00 

CO 


o 
in 
xj- 


CO 

o 


o 

CO 
CO 


CN 

■«r 

CO 


m 

CD 


CO 
CM 


45 


Terminal 
(nt) 


2501669 


2501735 


2503355 


2504265 


2503984 


2504300 


2504831 


2507663 


2507710 


2508840 


2509530 


2509523 


2511423 


2511876 


2511949 


2512409 


2513144 


2513154 


2513692 


50 


Initial 
(nt) 


2499783 


2502577 


2502735 


2503870 


2504247 


2504602 


2507098 


2507115 


2507138 


2508094 


2508922 


2510830 


2511046 


2511427 


2512356 


2512768 1 


2512803 


2513618 


2514114 




So s 

CO 2 S 


6085 


6066 


6087 


8809 


6089 


0609 


1609! 


6092 


6093 


6094 


6095 


9609 


6097 


6098 


6609 


6100 


6101 


6102 1 


6103 


55 


SEQ 
NO. 


2585 


2586 


2587 


2588 


2589 


2590 


1 2591 


2592 


2593 


|2594 


2595 


2598 


2597 


2598 


2599 


2600 


2601 


2602 


2603 
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3 
C 



5 



Function 


folyl-polyglutamate synthetase 








valyl-tRNA synthetase 


oligopeptide ABC transport system 
substrate-binding protein 


heat shock protein dnaK 


lysine decarboxylase 


malate dehydrogenase 


transcriptional regulator 


hypothetical protein 


vanillate demethylase (oxygenase] 


pentachlorophenol 4- 
monooxygenase reductase 


transport protein 


malonate transporter 


class-Ill heat-shock protein or ATP- 
dependent protease 


hypothetical protein 


succinyl CoA:3-oxoadipate CoA 
transferase beta subunit 


succinyl CoA:3-oxoadipate CoA 
transferase alpha subunit 


Matched 
length 
(a.a) 










lO 
09 


CM 

in 


CO 

o 

tn 


o 

r- 


cn 

r- 
c^ 


o 

CM 


CO 

o 

CM 


m 

CO 


CO 
CO 

CO 




S 

CM 


o 

CO 


CD 
(O 
CO 


o 

CM 


m 

CM 


Similarity 
(%) 


79,6 








72.1 


58.5 


54.9 j 


71.2 


76.5 


56.5 


51,4 


68.6 


59.2 


76.8 


58.4 


85.8 


73.0 


85.7 


84.5 


Identity 
(%) 


55.4 








45,5 


24.2 


26.2 1 


42.9 


56.4 


24.6 


26.0 


39.5 


32.8 


40.8 


28.0 


59.8 


45.6 


63.3 


60.2 


Homologous gene 


Streptomyces coelicolor A3(2) 
folC 








Bacillus subtilis 168 balS 


Bacillus subtilis 168 oppA 


Bacillus subtilis 166 dnaK 


Eikenella corrodens ATCC 
23824 


Thermus aquaticus ATCC 33923 
mdh 


cT 

< 

o 
o 

.£! 
"oi 
o 
o 
to 

O CO 

O T- 

Q.< 
O ^ 

is O 
CO CO 


Vibrio cholerae aphA 


Acinetobacter sp. vanA 


Sphingomonas flava ATCC 
39723 pcpD 


Acinetobacter sp. vanK | 


Klebsiella pneumoniae mdcF 


Bacillus subtilis cIpX 


Streptomyces coelicolor A3(2) 
SCF55.28C 


Streptomyces sp. 2065 pcaJ 


Streptomyces sp. 2065 peal 


db Match 


03 
CM 

in 

& 

CM 

a. 








3 
CO 

o 
< 

00 

^' 

CO 
Q. 

</) 


pir:A38447 


sp:DNAK_BACSU 


(0 

C7) 
GO 

3 
O 
UJ 

o. 

O) 


sp:MbH_THEFL 


CO 

< 

o 

CO 

CL 
O) 


CM* 
(O 

£ 
< 

b. 
cn 


a. 

CO 

in 

CM 

t: 
cx 


gp:FSU12290_2 


prf.2513416G 


CO 

o 
m 
Oi 
3 
O. 

Q. 
CO 


< 

CO 

o 

CO 
CM 

iz 
a. 


CO 

in 
m 

IL 

O 
(0 

CL 

Oi 


CM 

CD 
CO 
CO 
Oi 
O 

< 
P> 


CD 
CO 

cn 
o 

< 

CL 

o> 




1374 


CM 
S 


r- 


CO 


2700 


1575 


1452 


in 

00 

m 


S 

Oi 




CD 

m 


1128 


m 

at 


1425 


O 

CO 

a> 


1278 


1086 


CO 

CO 
CD 


O 

in 


Terminal 
(nt) 


2514114 


2516273 1 


2516956 


2517751 


2515637 1 


2518398 


2521660 


2521667 


2522265 


2524337 


2524340 


2526226 


2527207 


2528559 


2528551 


2529484 


2531976 


2531969 


2532604 


Initial 
(nl) 


2515467 


2515662 


2516243 


2517089 


2518336 


2519972' 


2520209 


2522251 


2523248 


2523561 


2524915 


2525099 


2526233 


2527135 


2529480 


2530761 


2530891 


2532601 


2533353 


Sgs! i 

CO 2 S| 5 


6105 


6106 


6107 


6108 


6109 


6110 


6111 


6112 


6113 


6114 


6115 


6116 


6117 


6118 


6119 


6120 


6121 


6122 


SEQ 
NO. 
(DNA) 

2604 


2605 


2606 


2607 


2608 


2609 


2610 


2611 


2612 


2613 


2614 


2615 


2616 


2617 


2618 


2619 


2620 


CM 
03 
CM 


2622 
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c 



0) 



Function 


protocatechuate catabofic protein 


beta-ketothiolase 




3-oxoadipate enoMactone hydrolase 
and 4-carboxymuconolactone 
decarboxylase 


transcriptional regulator 


3-oxoadipate enoMactone hydrolase 
and 4-carboxymuconolactone 
decarboxylase 




3-carboxy-cis.cis-muconate 
cycloisomerase 


protocatechuate dioxygenase alpha 
subunit 


protocatechuate dioxygenase beta 
subunit 


hypothetical protein 


muconolactone isomerase 




muconate cycloisomerase 1 




catechol 1.2-dloxyaenase 


> 


toluate 1.2 dioxygenase subunit 


Matched 
length 
(aa) 


rsi 


CO 

o 




CO 

in 

CM 


m 

CM 
CO 


in 




CO 


CM 


CM 


CO 

CN 


CM 

a> 




CM 

CO 




m 

00 
CN 




CO 


Similarity 
{%) 


to 

CO 


71.9 




76.6 


43.0 ■ 


89.6 




63.4 


70.6 


91.2 


48.7 


81.5 




84.7 




88.4 




85.6 


Identity 
(%) 


58.2 


44.8 




50-8 


23.6 


78.3 




39.8 


49.5 


74.7 


26.4 


54.4 




60.8 




72.3 




62.2 


Homologous gene 


Rhodococcus opacus 1CP pcaR 


Ralstonia eutropha bktB 




Rhodococcus opacus pcaL 


Streptomyces coelicolor A3(2) 
SCM1.10 


Rhodococcus opacus pcaL 




Rhodococcus opacus pcaB 


Rhodococcus opacus pcaG 


Rhodococcus opacus pcaH 


Mycobacterium tuberculosis 
H37RV Rv0336 


Mycobacterium tuberculosis 
catC 




Rhodococcus opacus 1CP catB 




Rhodococcus rhodochrous catA 




Pseudomonas putida plasmid 
pDK1 xylX 


db Match 


LL 
CM 

<o 

00 

o 

CM 

a. 


prf:2411305D 




UJ 

^ 

CM 
CO 
CO 
O 

-^r 

CM 

<x 


o 
o 

CO 
Q. 
O) 


Ui 
CM 

S 

o 

CM 

t: 

Q. 




a 

CM 
CO 
00 

o 

CM 

t* 
CL 


o 

CO 
00 

o 

CM 

t: 

Q. 


m 

CM 
CO 
CO 

o 

CM 

t: 

Q. 


pirG70506 


CD 

CO 
CO 
CO 

in 
m 

CN 

CL 




sp:CATB_RHOOP 




prf.2503218A 




CO 

CO 

at 




CM 
O) 


1224 


CM 


CO 
lO 


2061 


CO 
CO 
CO 


00 
CO 


1116 


CM 
CD 


O 
CT) 
CO 


1164 


o> 

CM 




1119 


CO 

o 

CO 


m 
m 
oo 




1470 


Terminal 
. (nt) 


2534182 


2535424 


2534257 


2536182 


2538256 


00 

CM 
CO 
CO 

m 

CM 


2540230 


2538616 


2539709 


2540335 


2541187 


2542512 


2543813 


2542818 


2544867 


2544022 


2544928 


2546784 


Initiaf 
(nt) 


2533391 


2534201 


2535168 


2535430 


2536196 


2538613 


2539553 


2539731 


2540320 


2541024 


2542350 


2542802 


2543043 


2543936 


2544262 


2544876 


2545068 


2545315 


SEQ 
NO. 
(B.a.) 


6123 


6124 


6125 


6126 


6127! 


6128 


6129 


6130 


6131 


6132 


6133 


6134 


6135 


6136 


6137 


6138 


6139 


6140 


SEQ 
NO. 
(DNA) 


2623 


2624 


2625 


2626 


2627 


2628 


2629 


2630 


2631 


2632 


2633 


2634 


2635 


2636 


2637 


CO 
CO 
CO 
IN 


cn 

CO 
(0 
CM 


2640 
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5 
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Function 


totuate 1.2 dioxygenase subunit 


toluate 1,2 dioxygenase subunit 


1 ,2-dihydroxycyclohexa-3.5-diene 
carboxytate dehydrogenase 


regulator of LuxR family with ATP- 
binding site 


transmembrane transport protein or 
4-hydroxyben2oate transporter 


benzoate membrane transport 
protein 


ATP-dependent CIp protease 
proteolytic subunit 2 


ATP-dependent CIp protease 
proteolytic subunrt 1 


hypothetical protein 


trigger factor (prolyl Isomerase) 
(chaperone protein) 


hypothetical protein 


penicillin-binding protein 


hypothetical protein 




■ transposase 




hypothetical protein 


transposase 


15 


Matched 
length 
(aa) 


CO 


CN 
CO 


1 

IIZ 


O) 


m 

CO 


00 
00 
CO 


h- 
O) 

1 — 


CO 
O) 


CN 


T— 


o 

CO 


CO 
CO 
CO 


to 




CN 




to 

CO 


in 


20 


Similarity 

(%) 1 


83.2 


81.0 


61.4 


48.6 


CO 


66,2 


88.3 


85.9 


71.4 


66.4 


63,1 


50.9 


CO 

od 
tn 




73.2 




62.9 


78.7 




Identity 
(%) 


60.3 


51.5 


30.7 


CO 

CO 
CN 


31.3 


29.9 


u> 

Ol 
CO 


62.1 


as 

CM 


32,1 


32.5 


25.3 


27.8 




54,2 




57.1 


N- 
O 

to 


25 

a> 

C 

c 

30 ^ 
.a 

35 
40 


Homologous gene 


Pseudomonas putida plasmid 
pDKI xylY 


Pseudomonas putida plasmid 
pDK1 xyl2 


Pseudomonas putida plasmid 
pDK1 xylL ! 


Rhodococcus erythropolis thcG 


Acinetobacter calcoaceticus 
pcaK 


Acinetobacter calcoaceticus 
benE 


streptomyces coelicolor Ml 45 
c!pP2 


streptomyces coelicolor Ml 45 
clpP1 


Sulfolobus islandicus ORF154 


Bacillus subtilis 168 tig 


! Streptomyces coelicolor A3{2) 
isCD25.17 


Nocardia lactamdurans LC411 
pbp 


Mus musculus Moal 




Corynebacterium striatum 0RF1 




Corynebacterium striatum 0RF1 


Corynebacterium striatum ORF1 


db Match 


gp:AF134348_2 


CO 
CO 
CO 
CO 
U- 

< 

CL 
Ui 


gp:AF 134348^4 


o' 

in 
cn 

UJ 

o:: 


< 
o 

o 

5 
s 

Q. 

CL 
lA 


< 
o 

I 

z 

UJ 

CD 

Q. 
U) 


m 

CO 
CO 

o 
u. 

< 

iCL 


1 

m 

GO 
00 

T— 

o 
u. 

< 

CL 
Ol 


gp:SIS243537_4 


sp:TIG_BACSU 


gp:SCD25_17 


o 
o 
z 

^' 

a. 

CD 

CL 

d. 
tn 


< 

CM 

P> 

O 
CO 
CM 

a 




prft2513302C 




O 

S 
CO 

CO 

^ 
to 

CN 
t 


o 

CN 
O 
CO 
CO 

in 

CN 

t: 

CL 






CN 

cn 


1536 


GO 
CN 
00 


2685 


1380 


1242 


CM 
CO 


CO 

o 

CO 


o 
to 


1347 


in 


to 
o> 


CO 

to 


as 

CN 


oo 

CO 


o 
to 


CD 
(N 


CO 
CN 


45 


Temiinal 
(nt) 


2547318 


2548868 


2549695 


2552455 


' 2553942 


2555267 


2555317 


2555978 


^ 2556748 


2556760 


2559103 


2560131 


2560586 


2561363 


I 2561483 


! 2562242 


2561990 


2562078 


50 


Initial 
(nt) 


2546827 


2547333 


2548868 


2549771 


2552563 


2554026 


2555940 


2556580 


2556599 


2558106 


2558609 


2559157 


2560131 


2561115 


2561920 


2562093 


2562115 


2562341 


1 
1 


CO 2 


6141 


6142 


6143 


6144 


6145 


6146 


6147 


6148 


6149 


6150 


6151 


6152 


CO 
CO I 


6154 


6155, 


6156 


16157 


6158 


55 


SEQ 1 
NO. ! 
(ONA) 


2641 


2642 


2643 j 


CO 
CN 


2645 


2646 i 


2647 


2648 


2649 


2650 


2651 


2652 


2653 


2654 


2655 


2656 


2657 


CN 
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c 



0) 
X2 



Function 






galactose-6-phosphate isomerase 


hypothetical protein 


hypothetical protein 


aminopeptidase N | 


hypothetical protein 








phytoene desaturase 






phytoene dehydrogenase 


phytoene synthase 


multidrug resistance transporter 




ABC transporter ATP-binding protein 


dipeptide transport system 
permease protein 


nickel transport system pemiease 
protein 




Matched 
length 
(aa) 






o 


00 


Oi 


o 

Ui 
GO 


00 

in 

CO 








T- 






T- 

co 

CO 


o 
a> 

CM 








CD 
GO 
CM 


CD 

CO 




•c 




















































CD 


m 
















00 


CO 






(O 


CO 


o 












00 

in 


O 
00 


c> 
1^ 


00 

in 








CO 






CO 
CO 


CO* 

in 








CO 


CM* 
CD 


































































































o 


CM 


GO 


in 










in 






CM 




GO 




CO 


CO 


CM 




Is 






o 


(d 

CM 


CO 

in 




in 

CM 








CO 






CO 


CO 


th 

CM 






fid 

CO 


CO 
CO 




Homologous gene 






staphylococcus aureus NCTC 
8325^ lacB 


Bacillus acidopullulytlcus ORF2 


Mycobacterium tuberculosis 
H37RvRv2466c 


Streptomyces lividans pepN 


Borrelia burgdorferi BB0852 








Brevibacterium linens ATCC 
9175 crti 






Myxococcus xanthus DK1050 
carA2 


Streptomyces griseus JA3g33 
crtB 


Listeria monocytogenes IltB 




Synechococcus elongatus 
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Archaeoglobus fuigidus AF0251 


Corynebacterium glutamicum 
ATCC 13032 nrdE 




Rickettsia prowazekii 


Bacillus subtilis 168 nadE 






Synechocystis sp. PCC6803 
slr1563 
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Function 


hypothetical membrane protein 


hypothetical membrane protein 


hypothetical protein 


transposase (IS1676) 


major secreted protein PS1 protein 
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transposase (IS1 676) 




proton/sodium-glutamate symport 
protein 




ABC transporter 
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33.0 




45.4 


60.0 


71.0 




28.1 
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Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv3069 


Helicobacter pylori J99jhp1 146 


Bacillus subtilis 168 ycsl 


Rhodococcus erythropolis 


Corynebacterium glutamicum 
(Brevibacterium fiavum) ATCC 
17965 cspl 
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Streptomyces coelicolor A3(2) 
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Chlamydophila pneumoniae 
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2703356 
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2690150 
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2690773 


2691689 


2693299 
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2702466 
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2704314 
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2709878 


2710637 
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0* 
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6295| 
62961 
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6302 
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6304 
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6307 


6308 
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NO. 
(DNA) 
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Function 


methyltransferase 


hypothetical protein 


hypothetical protein 




UDP-N-acetylglucosamine 1- 
carboxyvinyltransferase 


hypothetical protein 


transcriptional regulator 




cysteine synthase 


O-acetylserine synthase 1 


hypothetical protein 


succinyl-CoA synthetase alpha 
chain 


hypothetical protein 


succinyl-CoA synthetase beta chain 




frenolicin gene E product 




succinyl-CoA coenzyme A 
transferase 


transcriptional regulator 
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Homologous gene 


Mycobacterium tuberculosis 
H37RV RV0089 


Chlamydia pneumoniae 


Chlamydia muridarum Nigg 
TC0129 




Adnetobacter calcoaceticus 
NCIB 8250 murA 


Mycobacterium tuberculosis 
H37RV Rv1314c 


streptomyces coeiicolor A3(2) 
SC2G5.15C 




Bacillus subtilis 168 cysK 


Azotobactervinelandii cysE2 


Deinococcus radiodurans R1 
DR1844 


Coxiella burnetii Nine Mile Ph 1 
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Aeropyrum pernix K1 APE1069 
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Azospirillum brasilense ATCC 
29145 ntrC 
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2713842 


2717993 


2718436. 


2720319 


2720385 


2721295 


2722857 , 


2723609 


2723770 


2724478 


2725843 


2725384 


2726786 


2727399 


2728207 


2729378 


2732518 
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(nt) 


2711850 


2713181 


2713702 


2718187] 


2719689 


2719750 


2721227 


2721702 


2721934 


2723064 


2724057 


2725359 


2725619 


2726577 


2727145 
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2729025 
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2731376 
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Function 




phosphate transport system 
regulatory protein 


phosphate-specific transport 
component 


phosphate ABC transport system 
permease protein 


phosphate ABC transport system 
permease protein 


phosphate-binding protein S-3 
precursor 


acetyltransferase 




hypothetical protein 


hypothetical protein 


branched-chain amino acid 
aminotransferase 


hypothetical protein 


hypothetical protein 


5*.phosphoribosyl-5-aminolmtda2ole 
synthetase . 


amidophosphoribosyl transferase 
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length 
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56.0 
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81.0 
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94.2 
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81.0 
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c 
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Homologous gene 




Mycobacterium tuberculosis 
H37RV Rv0821cphoY-2 


Pseudomonas aeruginosa pstB 


Mycobacterium tuberculosis 
H37RV Rv0830 pstA1 


Mycobacterium tuberculosis 
H37RV Rv0829 pstC2 


Mycobacterium tuberculosis 
H37RV phoS2 


Streptomyces coeficolor A3{2) 
SCD84.18C 




Baditus subtilis 168 bmrU 


Mycobacterium tuberculosis 
H37RV Rv0813c 


Solanum tuberosum BCAT2 


Corynebacterium 
ammoniagenes ATCC 6872 
ORF4 


Mycobacterium tuberculosis 
H37RV Rv0810c 


Corynebacterium 
ammoniagenes ATCC 6872 
purM 


Corynebacterium 
ammoniagenes ATCC 6872 
purF 
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2734351 


2735184 
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hypothetical protein 
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hypothetical membrane prol 
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5'-pho5phohbosyl-N- 
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Table 1 (continued) 


Homologous gene 


Mycobacterium tuberculosis 
H37RV Rv0807 


Corynebacterium 
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Function 


pyruvate oxidase 


multidrug efflux protein 


transcriptional regulator 
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Table 1 (continued) 


Homologous gene 


Escherichia coli K12dapA 


Streptomyces coelicolor A3(2) 
SC6E1 0.20c glk 


Clostridium perfringens NCTC 
8798 nanE 




Micromonospora viridifaciens 
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Bacillus firmus OF4 dppA 
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virulence factor 


CIpC adenosine triphosphatase / 
ATP-binding proteinase 


inosine monophosphate 
dehydrogenase 


transcription factor 


phenol 2-monooxygenase 










lincomycin resistance protein 


hypothetical protein 
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hypothetical membrane protein 
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hypothetical membrane protein 


hypothetical membrane protein 


propionyl-CoA carboxylase complex 
B subunit 


polyketide synthase 


acyl-CoA synthase 


hypothetical protein 




major secreted protein PS1 protein 
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hypothetical membrane protein 


nodulation protein 
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Mycobacterium tuberculosis 
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Streptomyces coelicolor A3(2) 
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dimethytanitine monooxygenase (N- 
oxide-forming) 




UDP-galactopyranose mutase 


hypothetical protein 


glycerol kinase 


hypothetical protein 


acyltransferase 


seryl-tRNA synthetase 


transcriptional regulator. GntR family 
or fatty acyt-responsive regulator 


hypothetical protein 
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Table 1 (continued) 


Homologous gene 






Sus scrofa fmo1 
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Table 1 (continued) 


Homologous gene 


Streptomyces coelicolor A3{2) 
SC6G4.33 








Streptomyces lavendulae 
ORF372 


Saccharomyces cerevisiae 
S288C YIR019C stal 




Bacillus subtilis gtpQ 


Bacillus subtilis gntP 






Coryhebacterium glutamicum 
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Function 






two-component system sensor 
histtdine kinase 


hypothetical protein 


hypothetical protein 


stage III sporulation protein 


transcriptional repressor 


transglycosylase-associated protein 


hypothetical protein 


hypothetical protein 


RNA pseudouridylate synthase 


1 hypothetical protein 


hypothetical protein 




bacterial regulatory protein. gntR 
family or glc operon transcriptional 
activator 


hypothetical protein 


hypothetical protein 
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length 
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Table 1 (continued) 


Homologous gene 






Corynebacterium diphtheriae 
ChrS 


Streptomyces coelicolor A3(2) 
SCH69.22C 


Streptomyces coelicolor A3(2) 
SCH69.20C 


Bacillus subtilis spolllJ 


Mycobacterium tuberculosis 
H37RV Rv3173c 


Escherichia coli K12 MG1655 
tag1 1 


Mycobacterium tuberculosis ' 
H37RvRv2005c 


Escherichia coli K12 MG1655 " 
yhbW 


Chlorobium vibrioforme ybcS 


Chlamydia pneumoniae 


Chlamydia muridarum Nigg 
TC0129 




Escherichia coli K12 MG1655 
glcC 


streptomyces coelicolor 
SC4G6,31c 


Mycobacterium tuberculosis 
H37RV Rv2744c 
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methyltransferase 


nodulin 21 -related protein 








transposon tn501 resolvase 1 




ferredoxin precursor 1 


hypothetical protein 1 


transposase 


transposase protein fragment 
TnpNC 




glyceraldehyde-3-phosphate 
dehydrogenase (pseudogene) 


lipoprotein 


copper/potassium-transporting 
ATPase B or cation transporting 
ATPase (E1-E2 family) 
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Streptomyces coelicolor A3(2) 


Corynebacterium glutamicum 
Tnp1673 


Corynebacterium glutamicum 




Pyrococcus woesei gap 


Synechocystis sp. PCC6803 
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Archaeoglobus fulgidus AF0152 
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Function 




two-component system sensor 
histidine kinase 




two-component response regulator 
or alkaline phosphatase synthesis 
transcriptional regulatory protein 




laccase or copper resistance protein 
precursor A 


thioi'.disulfide interchange protein 
(cytochrome c biogenesis protein) 


quinone oxidoreductase 
(NADPH.quinone reductase)(setar 
crystallin) 
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hypothetical protein 
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Homologous gene 




Escherichia coli K12 baeS 




Bacillus subtilis phoP 
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Function 


iron(lll) dicitrate-binding periplasmic 
protein precursor or iron(lll) dicltrate 
transport system permease protein 


mrtochondrial respiratory function 
protein or zinc-binding 
dehydrogenase or NADPH quinone 
oxidoreductase 






phosphomethylpyrimidine kinase 




mercuric ion-binding protein or 
heavy-metal-associated domain 
containing protein 


branched-chain amino acid transport 1 


branched-chain amino acid transport 


hypotheticat protein 


tRNA nucleotidyltransferase 1 


mutator mutT protein 




hypothetical membrane protein 


hypothetical membrane protein 




RNA polymerase sigma-H factor or 
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Table 1 (continued) 
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Bacillus subtUis thiD 




Bacillus subtilis yvgY 
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thioredoxin ch2, M-type 


N-acetylmuramoyl-L-alanine 
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hypothetical protein 
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Table 1 (continued) 


Homologous gene 
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446521 
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3301358 
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3306532 1 
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Function 


elongation factor Tu 


preprotein translocase secY suburt 


isocitrate dehydrogenase 
(oxalosuccinatedecarboxylase) 


acyl-CoA carboxylase or biotin- 
binding protein 


citrate synthase 


putative binding protein or peptidyl- 
prolyl cis-trans isomerase 


glycine betaine transporter 


hypothetical membrane protein 


L-lysine permease 


aromatic amino acid permease 


hypothetical protein 


succinyl diaminopimelate 
desuccinytase 


proline transport system 


arginyl-tRNA synthetase 
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Corynebacterium glutamicum 
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Corynebacterium glutamicum 
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Corynebacterium glutamicum 
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Corynebacterium glutamicum 
ATCC 13032 gItA 


Corynebacterium glutamicum 
ATCC 13032 fkbA 


Corynebacterium glutamicum 
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Corynebacterium glutamicum 
ATCC 13032 orf2 


Corynebacterium glutamicum 
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ATCC 13032 aroP 


Corynebacterium glutamicum 
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Corynebacterium glutamicum 
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Corynebacterium glutamicum 
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diaminopimelate (DAP) 
decarboxylase (meso- 
diaminopimelate decarboxylase) 


homoserine dehydrogenase 


homoserine kinase 


ion channel subunit 


lysine exporter protein 


lysine export regulator protein 


acetohydroxy acid synthase, large 
subunit 


acetohydroxy acid synthase, small 
subunit 


acetohydroxy add isomeroreductase 


3-isopropylmatate dehydrogenase 


PTS system, phosphoenolpyruvate 
sugar phosphotransferase 
(mannose and glucose transport) 


acetylglutamate kinase 


ornithine carbamoyltransferase 


arginine repressor 
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Corynebacterium glutamicum 
AS019 ATCC 13059 lysA 


Corynebacterium glutamicum 
AS019 ATCC 13059 horn 


Corynebacterium glutamicum 
AS019ATCC13059thrB 


Corynebacterium glutamicum 
R127orf3 


Corynebacterium glutamicum 
R127 lysE 


Corynebacterium glutamicum 
R127 lysG 


Corynebacterium glutamicum 
ATCC 13032ilvB 
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ATCC 13032 ilvN 
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Corynebacterium glutamicum 
KCTC1445ptsM 


Corynebacterium glutamicum 
ATCC13032 argB 


Corynebacterium glutamicum 
ATCC 13032 argF 


Corynebacterium glutamicum 
AS019argR 


db Match 


-1 
O 

ct 
o 

o 

o 

Q 
d 


o 

X 
Q 

b. 

(A 


-J 
O 
a: 

O 

CO 

i 

d. 
to 


gsp:W37716 


sp:LYSE_CORGL 


-J 
o 

I' 

CO 

>i 

o. 


-1 
O 

o 

3 

d. 


CO 

i 

CD 

l: 
'5. 


pir.C48648 


lsp:LEU3.CORGL 


CO 
CM 

o 

CM 

f 

a. 


-J 
o 
q: 
O 
o 

m 

< 

d. 

Ui 


-J 
o 

i 

d. 

tn 


gp:AF041436J 




1335 


1335 


1^ 

CM 

o> 


cv 

CO 


CO 

o 


o 

£o 


1878 


to 
tn 


1014 


!l020 


o 

CM 


00 


tn 


CO 

in 


Terminal 
(nt) 


1241263 


1243841 


1244781 


1328243 


1328246 

1 


1329884 


1340008 


1340540 


1341737 


1354508 


1425265 


1467372 


1469521 


1470040 


Initial 
(nt) 


1239929 


1242507 


1243855 


1327617 


1328953 


1329015 


1338131 


1 1340025 


1340724 


1353489 


1423217 


1466491 


1468565 


1469528 


Sis 

CO 2 S 


6951 


6952 


6953 


6954 


6955 


6956 


6957 


6958 


6959 


0969 


6961 


6962 


6963 


6964 


HO 2 Q 1 


3451 


3452 


3453 


3454 


3455 


3456 


3457 


3458 


3459 


3460 


3461 


3462 


3463 


CD 
CO 



EP1 108 790 A2 



10 



15 



20 



25 



30 



T3 

a> 
3 



a> 



35 



40 



45 



50 



55 



o 

c 
u. 



1 



-2 5" 



IE 

•o 



3 
O 

o> 
o 
o 

E 
o 
X 



o 



CO 

S ^ 

0) 



I? 



So ? 

W2 S 



w z o 



1^ 



o 
o 



o 
o 



11 

o H 



o 

GO 



O 



o 



CO 



in 
in 



in 



m 

CO 



01 

tf 
II 

o c 

JZ >s 

a. CL 



o 

6 
o 



o 
o 



£ 
E 

(0 

o 
E 

|uj 

O tn 
(0 
J3 

O o> 

C 

£ro 

O CO 



o 

CD 
GO 
O 



Q. 



<0 



in 

(O 

oo 
m 



in 

CM 

CD 
CO 

m 



CO 



0) 
(A 

n 

t 

CO 

■0 



CM 



o 
o 



o 
o 



5 8 

o H- 



CM 
CO 

rs- 
r*^ 
O 
o 
—I 
CD 
O 

CL 



CD 
CO 
O 



CO 
CM 

CD 



00 
O 
CN 

in 

CD 



(O 
CD 



CO 
CO 

r> 



CD 



JZ 
JZ 

c" 

0) 



CL 

e 

3 



ii 

(0 (0 



tn 



o 
o 



o 
o 
o 



E 

3 

£ 

fi 

0) CN 
0> ^ 

O H 



o 



8 

CL 
O) 



CD 

m 

CO 



CO 
CO 
CN 

in 

CD 



CO 
CN 
CD 
CO 

CO 



CO 
(O 
O) 
CD 



O 
O 



o 

O 



= w 



^1 

CN 
CO 

r*. 

o 
o 

O 
O 

Q. 
C3) 



CN 



s 

CD 



CN 
CO 



00 
CO 

CO 



cn 

CO 

cn 

CO 



O) 
CO 

CO 



Q> 

t 

e 
8 



I 

o 
c 

0) 

o 

M 

o 



CJ) 



o 
o 
o 



o 
o 



E 

3 

u 
E 

CO 

i ^ 

0) CN 

£^ 

o H 



;5 

CD 

Si 

o 
in 

f 

Q. 



m 

CN 



00 

cn 

CO 



CO 



o 

00 
CO 



o 
cn 

CD 



o 



to 

OB 

M 
O 



to 
£ 

- ^ 



o o 
c -c 

4) CL 



o 
o 



o 
o 



£ 

3 
O 

£ 
I 

O) 

£ 

3 

•c 

J3 CO 

I- 

O (0 

o< 



o 
o 

CO 
CN 

i 

CL 



O 

cn 

CM 



00 

s 

o 

CM 



CJ) 
CD 



cn 



in 

CO 

0) 
o 

3 
C 
O 

•o 
c 

0> 

c 
o 



CM 

cn 

to 



o 
o 
o 



o 
o 
o 



£ 

3 

JO 

£ 

CO 

^ o 
0) CN 

o h- 

o < 



to 
in 
CD 



CD 
CO 



m 

CO 

cn 

CN 
CO 
CO 



8 
S 

CD 

CO 



CD 



£ 

o 
a. 

Ii 

og 

O CO 
U1 ^ 



CO 
CO 



o 
o 



o 
o 



E 

CO 

ECD 
C7) 



CD C3) 

o H 

o< 



Q 

CO 
00 
CN 



tf 



CO 

CD 



s 

T— 

CN 

8 



00 
O 
CN 
O 
CN 



cn 

CD 



cn 



c 
*5 
"S 
cx 

o> 
c 

c 
15 

i 

£ 
I 

OJ 



8 



o 
o 



o 
o 



•i °' 

o \- 



o 
o 

3 

09 



in 

00 



o 
cn 

CO 

o 

CN 



o 

CN 

s 
s 

CN 



0) 

"5 



o 
o 



o 
o 



u 
E 

CO 
O) 

I 

o (O 
o < 



O 
q: 
o 
o 

<' 

o 
ai 
q: 

iCL 



00 

CN 



(7) 
00 

cn 

CO 
CD 

o 

CN 



CO 

s 

CN 



1^ 

at 

CD 



in 

CD 
CO 



1^ 



c 



CO 

c 
o 

o- 



O 
CO 



o 
o 



o 
o 



E 

E c 
o 

°'^< 

E c « 

*^ is 
0) 'C c 

tsis 
If o 
5 £5 



Ui 

cr 

GO 

i' 

a. 



cn 
o 



00 
CN 
O) 

o 

CN 



cn 

CO 



o 

CN 



CO 
CD 



CO 



0) 
3 



O 
O 

I 



00 
CN 



O 

o 



o 
o 



£ 

3 A) 

"Is 

5 E <0 
3 C -O 

CO u ^ 

XI CO V- 
£*0) o 



O 
o 

m' 



C3 



00 

o 

CN 



CO 
CJ) 

o 

CN 



CO 



8 

(0 
cn 

CO 

c 
73 



o 
o 



o 
o 
o 



o 
o 



£ 

3 
O 

E 

CO 

B 
E 

3 

*C 

<u 

0) j 

C*CM 
O T- 

o a 



CD 

cn 

CN 



o 

CL 
CJ) 



s 

m 



CO 
CO 
CO 



CO 
CO 

CO 



CO 

cn 

CO 



CO 



EP1 108790 A2 



5 
10 


Function 


uridilylyitransferase, uridilylyl- 
removing enzyme 


nitrogen regulatory protein P-ll 


ammonium transporter 


glutamate dehydrogenase (NADP-*-) 


pyruvate kinase 


glucoklnase 


glutamine synthetase 


threonine synthase 


ectoine/proline/glycine betalne 
carrier 


matate synthase 


isocitrate lyase 


glutamate 5-kinase 


cystathionine gamma-synthase 


ribonucleotide reductase 
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Table 1 (continued) 


Homologous gene 


Corynebacterium glutamicum 
ATCC 13032 gInD 


Corynebacterium glutamicum 
ATCC 13032 gInB 


Corynebacterium glutamicum 
ATCC 13032 amtP 


Corynebacterium glutamicum 
ATCC 17965 gdhA 
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ATCC 13032 glk 
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ATCC 13032 gInA 
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Corynebacterium glutamicum 
ATCC 13032 aceB 
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Example 2 

Determination of effectiv mutation site 

(1) Identification of mutation sit based on the comparison of the gen nucleotide sequenc of lysine-producing B-6 
strain with that of wild type strain ATCC 13032 

[0374] Corynebacterium glutamicum B-6. which is resistant to S-(2-aminoethyl)cysteine (AEC), rifampicin, strepto- 
mycin and 6-azauracil, is a lysine-producing mutant having been mutated and bred by subjecting the wild type ATCC 
13032 strain to multiple rounds of random mutagenesis with a mutagen, N-methyl-N' -nitro-N-hitrosoguanidine (NTG) 
and screening {AppL MicrobioL BiotechnoL. 32: 269-273 (1989)). First, the nucleotide sequences of genes derived 
from the B-6 strain and considered to relate to the lysine production were detentiined by a method similar to the abov . 
The genes relating to the lysine production include lysE and ly$G\Nh\ch are lysine-excreting genes; ddh, dapA, horn 
and l^C (encoding diaminopimelate dehydrogenase, dihydropicolinate synthase, homoserine dehydrogenase and 
aspartokinase, respectively) which are lysine-biosynthetic genes; and pyc and zivf (encoding pymvate carboxylas 
and glucose-6-phosphate dehydrogenase, respectively) which are glucose-metabolizing genes. The nucleotide se- 
quences of the genes derived from the production strain were compared with the corresponding nucleotide sequences 
of the ATCC 13032 strain genome represented by SEQ ID N0S:1 to 3501 and analyzed. As a result, mutation points 
were observed in many genes. For example, no mutation site was observed in lysE, lysG, ddh, dapA, and the like, 
whereas amino acid replacement mutations were found in horn, lysC, pyc, zwf, and the like. Among these mutation 
points, those which are considered to contribute to the production were extracted on the basis of known biochemical 
or genetic infomnation. Among the mutation points thus extracted, a mutation. Val59Ala. in horn and a mutation, 
Pro468Ser, in pyc were evaluated whether or not the mutations were effective according to the following method. 

(2) Evaluation of mutation, Val59Ala. in ham and mutation, Pro458Ser, in pyc 

[0375] It is known that a mutation in horn inducing requirement or partial requirement for homoserine imparts lysin 
productivity to a wild type strain {Amino Acid Fermentation, ed. by Hiroshi Alda ef a/., Japan Scientific Societies Press). 
However, the relationship between the mutation, VaI59Aia. in hom and lysine production is not known. It can be x- 
amined whether or not the mutation, Val59A!a, in hom is an effective mutation by introducing the mutation to the wild 
type strain and examining the lysine productivity of the resulting strain. On the.other hand, it can be examined whether 
or not the mutation. Pro468Ser, in pyc is effective by introducing this mutation into a lysine-producing strain which has 
a deregulated lysine-bloxynthetic pathway and is free from the pyc mutation, and comparing the lysine productivity of 
the resulting strain with the parent strain. As such a lysine-producing bacterium. No. 58 strain (FERM BP-7134) was 
selected (hereinafter referred to the "lysine-producing No. 68 strain" or the "No. 58 strain"). Based on the above, it was 
detemiined that the mutation. Val59Ala, in hom and the mutation, Pro458Ser, in /3yc were introduced into the wild typ 
strain of Corynebacterium glutamicum ATCC 13032 (hereinafter referred to as the "wild type ATCC 13032 strain" or 
the "ATCC 13032 strain") and the lysine-producing No, 58 strain, respectively, using the gene replacement method. A 
plasmid vector pCES30 for the gene replacement for the introduction was constructed by the following method. 
[0376] A plasmid vector pCE53 having a kanamycin-resistant gene and being capable of autonomously replicating 
in Corynefomn bacteria (MoL Gen, Genet, 196: 175-178 (1984)) and a plasmid pM0B3 (ATCC 77282) containing a 
levansucrase gene (sacfl) of Bacillus subtilis (Molecular Microbiology, &, 1195-1204 (1992)) were each digested with 
Psfl. Then, after agarose gel electrophoresis, a pCE53 fragment and a 2.6 kb DNA fragment containing sacS were 
each extracted and purified using GENECLEAN Kit (manufactured by BIO 101). The pCE53 fragment and the 2.6 kb 
DNA fragment were Ifgated using Ligation Kit ver. 2 (manufactured by Takara Shuzo), introduced into the ATCC 13032 
strain by the electroporation method {FEMS Microbiology Letters, 65: 299 (1989)), and cultured on BYG agar medium 
(medium prepared by adding 1 0 g of glucose, 20 g of peptone (manufactured by Kyokuto Pharmaceutical), 5 g of yeast 
extract (manufactured by Difco), and 16 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting its 
pH to 7.2) containing 25 ng/ml kanamycin at 30^*0 for 2 days to obtain a transfonnant acquiring kanamycin-resistanc . 
As a result of digestion analysis with restriction enzymes, it was confirmed that a plasmid extracted from the resulting 
transfonmant by the alkali SDS method had a structure in which the 2.6 kb DNA fragment had been insert d into the 
PsH site of pCE53. This plasmid was named pCES30. 

[0377] Next, two genes having a mutation point, hom and pyc, were amplified by PCR, and inserted into pCES30 
according^to the TA cloning method (Bio Experiment Illustrated vol. 3. published by Shujunsha). Specifically, pCES30 
was digested with BamHI (manufactured by Takara Shuzo), subject d to an agarose gel electrophoresis, and extracted 
and purified using GENECLEAN Kit (manufactured by BI0 1 01 ). The both ends of the resulting pCES30 fragment w re 
blunted with DNA Blunting Kit (manufactured by Takara Shuzo) according to the attach d protocol. The blunt-ended 
pCES30 fragment was concentrated by xtraction with phenol/chloroform and precipitation with ethanol, and allowed 
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to react in the presence of Taq polymerase (manufactured by Roche Diagnostics) and dTTP at 70^*0 for 2 hours so 
that a nucleotide, thymine (T), was added to th 3'-end to prepare a T vector of pCES30. 

[0378] Separately, chromosomal DNA was prepar d from the lysine-producing B-6 strain according to the method 
of Saito et al. (Biochem, Biophys. Acta, 72. 619 (1963)). Using the chromosomal DNA as a template, PGR was carried 
out with Pfu turbo DNA polymelase (manufactured by Stratagen ). In the mutated horn gen , the DNAs having the 
nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were used as the primer set. In the mutated pyc 
gene, the DNAs having the nucleotide sequences represented by SEQ ID NOS:7004 and 7005 were used as the primer 
set The resulting PGR product was subjected to agarose gel electrophoresis, and extracted and purified using GENE- 
GLEAN Kit (manufactured by BI0 1 01 ). Then, the PGR product was allowed to react in the presence of Taq polymerase 
(manufactured by Roche Diagnostics) and dATP at 72''C for 10 minutes so that a nucleotide, adenine (A), was added 
tothe3'-end. 

[0379] The above pGES30 T vector fragment and the mutated horn gene (1 .7 kb) or mutated pyc gene (3.6 kb) to 
which the nucleotide A had been added of the PGR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol, and then llgated using Ligation Kit ver. 2. The ligation products were introduced into the 
ATGG 13032 strain according to the electroporation method, and cultured on BYG agar medium containing 25 fig/ml 
kanamycin at SO^C for 2 days to obtain kanamycin-resistant transfomiants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 [ig/ml kanamycin, and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes, it was 
confimied that the plasmid had a structure in which the 1 .7 kb or 3.6 kb DNA fragment had been inserted into pGES30. 
The plasmids thus constructed were named respectively pGhom59 and pGpyc458. 

[0380] The introduction of the mutations to the wild type ATGG 13032 strain and the lysine-producing No. 58 strain 
according to the gene replacement method was carried out according to the following method. Specifically, pGhom59 
and pGpyc468 were introduced to the ATGG 13032 strain and the No. 58 strain, respectively, and strains in which the 
plasmid Is integrated into the chromosomal DNA by homologous recombination were selected using the method of 
Ikeda etaL (Microbiology 144: 1863 (1998)). Then, the stains in which the second homologous recombination was 
carried out were selected by a selection method, making use of the fact that the Bacillus subtills levansucrase encoded 
by pCES30 produced a suicidal substance (J. ofBacterloL, 174: 5462 (1992)). Among the selected strains, strains in 
which the wild type horn and pyc genes possessed by the ATGG 1 3032 strain and the No. 58 strain were replaced with 
the mutated horn and pyogenes, respectively, were isolated. The method is specifically explained below. 
[0381] One strain was selected from the transformants containing the plasmid. pGhom59 or pGpyc458. and the 
selected strain was cultured in BYG medium containing 20 jig/ml kanamycin, and pCG11 (Japanese Published Exam- 
ined Patent Application No. 91827/94) was introduced thereinto by the electroporation method. pGGII is a plasmid 
vector having a spectinomycln-resistant gene and a replication origin which Is the same as pGE53. After introduction 
of the pGGII, the strain was cultured oh BYG agar medium containing 20 [ig/ml kanamycin and 1 00 ng/ml spectinomycin 
at 30**G for 2 days to obtain both the kanamycin- and spectinomycin-resistant transformant. The chromosome of one 
strain of these transformants was examined by the Southern blotting hybridization according to the method reported 
by Ikeda et al, {Microbiology 144: 1863 (1998)). As a result, it was confirmed that pGhom59 or pGpyc458 had been 
integrated into the chromosome by the homologous recombination of the Gambell type. In such a strain, the wild typ 
and mutated horn or pyc genes are present closely on the chromosome, and the second homologous recombination 
is liable to arise therebetween. 

[0382] Each of these transformants (having been recombined once) was spread on Sue agar medium (medium 
prepared by adding 100 g of sucrose, 7 g of meat extract, 10 g of peptone, 3 g of sodium chloride, 5 g of yeast extract 
(manufactured by Difco), and 18 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting its pH 7.2) 
and cultured at 30**C for a day. Then the colonies thus growing were selected in each case. Since a strain in which the 
sacS gene is present converts sucrose into a suicide substrate, it cannot grow in this medium (J. BacterioL, 174: 5462 
(1 992)). On the other hand, a strain in which the sacB gene was deleted due to the second homologous recombination 
between the wild type and the mutated horn or pyc genes positioned closely to each other fomis no suicide substrate 
and, therefore, can grow in this medium. In the homologous recombination, either the wild type gene or the mutated 
gene is deleted together with the sacB gene. When the wild type is deleted together with the sacB gene, the gene 
replacement into the mutated type arises. 

[0383] Ghromosomal DNA of each the thus obtained second recombinants was prepared by the above method of 
Saito et al. PGR was canled out using Pfu turbo DNA polymerase (manufactured by Stratagene) and the attached 
buffer. In the hort) gene, DNAs having the nucleotid sequences represented by SEQ ID NOS:7002 and 7003 w re 
used as the primer set. Also, in the pyc gene was used, DNAs having the nucleotide sequences represented by SEQ 
ID NOS:70b4 and 7005 were used as the primer set. The nucleotide sequenc s of the PGR products w re determined 
by th conv ntional method so that it was judged whether the horn or pyc gene of the s cond recombinant was a wild 
type or a mutant. As a result, the second recombinant which were called HD-1 and No. 58pyc were target strains having 
the mutated horn gen and pyc gene, respectively. 
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(3) Lysin production test of HD-1 and No. 58pyc strains 

[0384] The HD-1 strain (strain obtain d by incorporating the mutation, Va!59Ala, in th horn gene into the ATCC 
13032 strain) and the No. 58pyc strain (strain obtained by incorporating the mutation, Pro458Ser, in the pyc gene into 
the iysine-producing No. 58 strain) were subj cted to a culture test in a 5 I jar fenmenter by using the ATCC 13032 
strain and the Iysine-producing No. 58 strain respectively as a control. Thus lysine production was examined. 
[0385] After culturing on BYG agar medium at 30''C for 24 hours, each strain was inoculated into 250 ml of a seed 
medium (medium prepared by adding 50 g of sucrose, 40 g of com steep liquor. 8.3 g of ammonium sulfate, 1 g of 
urea, 2 g of potassium dihydrogenphosphate, 0.83 g of magnesium sulfate heptahydrate, 10 mg of iron sulfate hep- 
tahydrate, 1 mg of copper sulfate pentahydrate, 1 0 mg of zinc sulfate heptahydrate, 1 0 mg of p-alanine, 5 mg of nicotinic 
acid, 1 .5 mg of thiamin hydrochloride, and 0.5 mg of biotin to 1 liter of water, and adjusting its pH to 7.2, then to which 
30 g of calcium carbonate had been added) contained in a 2 1 buffle-attached Erienmeyer .flask and cultured therein 
at 30''C for 12 to 1 6 hours. A total amount of the seed culturing medium was inoculated into 1 ,400 ml of a main cultur 
medium (medium prepared by adding 60 g of glucose, 20 g of com steep liquor, 25 g of ammonium chloride, 2.5 g of 
potassium dihydrogenphosphate, 0.75 g of magnesium sulfate heptahydrate, 50 mg of iron sulfate heptahydrate, 13 
mg of manganese sulfate pentahydrate, 50 mg of calcium chloride, 6.3 mg of copper sulfate pentahydrate, 1.3 mg of 
zinc sulfate heptahydrate, 5 mg of nickel chloride hexahydrate, 1.3 mg of cobalt chloride hexahydrate, 1.3 mg of am- 
monium molybdenate tetrahydrate, 14 mg of nicotinic acid, 23 mg of p-alanine, 7 mg of thiamin hydrochloride, and 
0.42 mg of biotin to 1 liter of water) contained in a 5 1 jar fermenter and cultured therein at 32**C, 1 wm and 800 rpm 
while controlling the pH to 7.0 with aqueous ammonia. When glucose in the medium had been consumed, a glucose 
feeding solution (medium prepared by adding 400 g glucose and 45 g of ammonium chloride to 1 liter of water) was 
continuously added. The addition of feeding solution was carried out at a controlled speed so as to maintain the dis- 
solved oxygen concentration within a range of 0.5 to 3 ppm. After culturing for 29 hours, the culture was temriinated. 
The cells were separated from the culture medium by centrifugation and then L-tysine hydrochloride in the supematant 
was quantified by high perfomnance liquid chromatography (HPLC). The results are shown in Table 2 below. 



Table 2 


Strain 


L-Lysine hydrochloride yield (g/l) 


ATCC 13032 


0 


HD-1 


8 


No. 58 


45 


No. 58pyc 


51 



[0386] As is apparent from the results shown in Table 2, the lysine productivity was improved by introducing the 
mutation, Val59AIa, in the horn gene or the mutation, Pro458Ser, in the pyc gene. Accordingly, it was found that the 
mutations are both effective mutations relating to the production of lysine. Strain, AHP-3, in which the mutation, 
Val59Ala, in the horn gene and the mutation, Pro458Ser, in the pyc gene have been introduced into the wild type ATCC 
13032 strain together with the mutation, Thr331lle in the lysC gene has been deposited on December 5, 2000, in 
National Institute of Bioscience and Human Technology. Agency of Industrial Science and Technology (Higashi 1-1-3, 
Tsukuba-shi, Ibaraki, Japan) as PERM BP-7382. 

Example 3 

Reconstruction of Iysine-producing strain based on genome information 

[0387] The Iysine-producing mutant B-6 strain (AppL Microbiol. Biotechnoi., 32. 269-273 (1989)), which has been 
constructed by multiple round randoni mutagenesis with NTG and screening from the wild type ATCC 13032 strain, 
produces a remartcably large amount of lysine hydrochloride when cultured in a jar at 32''C using glucose as a cari^on 
source. However, since the fermentation period is long, the production rate is less than 2. 1 g/I/h. Breedinjg to reconstitute 
only eff ctive mutations relating to the production of lysine among the stimated at least 300 mutations introduced into 
the B-6 strain in the wild type ATCC 1 3032 strain was performed. 

(1) Id ntification of mutation point and effectiv mutation by comparing th gn nucleotid sequenc of the B-6 strain 
with that of the ATCC 13032 strain 

[0388] Asd scribed abov ,th nucleotide sequences of genes derived from the B-6 strain were compared with the 
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corresponding nucleotid sequences of the ATCC 13032 strain genome represent d by SEQ ID NOS:1 to 3501 and 
analyzed to identify many mutation points accumulated in the chromosome of the B-6 str£un. Among these, a mutation, 
Val591Ala, in horn, a mutation, Thr311lle, in lysQ a mutation, Pro458Ser, in pyc and a mutation, Ala213Thr, in zwf 
were specified as effective mutations relating to the production of lysine. Breeding to reconstitute the 4 mutations in 
5 the wild type strain and for constructing of an industrially important lysine-producing strain was candied out according 
to the method shown below. 

(2) Construction of ptasmid for gene replacement having mutated gene 

10 [0389] The plasmid for gene replacement, pChom59, having the mutated horn gene and the plasmid for gene re- 
placement, pCpyc458, having the mutated pyc gene were prepared in the above Example 2(2). Plasmids for gene 
replacement having the mutated lysC and zwf were produced as described below. 

[0390] The tysC and zvuf having mutation points were amplified by PGR, and inserted into a plasmid for gene re- 
placement, pCES30, according to the TA cloning method described in Example 2(2) (Bio Experiment Illustrated, Vol. 3). 

15 [0391] Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the abov 
method of Saito etaL Using the chromosomal DNA as a template, PGR was canied out with Pfu turbo DNA polymerase 
(manufactured by Stratagene). In the mutated lysC gene, the DNAs having the nucleotide sequences represented by 
SEQ ID NOS:7006 and 7007 were used as the primer set. In the mutated zwf gene, the DNAs having the nucleotide 
sequences represented by SEQ ID NOS:700d and 7009 as the primer set The resulting PGR product was subjected 

20 to agarose gel electrophoresis, and extracted and purified using GENEGLEAN Kit (manufactured by BIO 101). Then, 
the PGR product was allowed to react in the presence of Taq DNA polymerase (manufactured by Roche Diagnostics) 
and dATP at 72°G for 10 minutes so that a nucleotide, adenine (A), was added to the 3'-end. 
[0392] The above pGES30 T vector fragment and the mutated iysC gene (1.5 kb) or mutated zwf gene (2.3 kb) to 
which the nucleotide A had been added of the PGR product were concentrated by extraction with phenol/chloroform 

25 and precipitation with ethanol, and then ligated using Ligation Kit ver. 2. The ligation products were introduced into the 
ATGC 13032 strain according to the electroporation method, and cultured on BYG agar medium containing 25 [ig/ml 
kanamycin at 30''G for 2 days to obtain kanamycin-resistant transformants. Each of the resulting transformants wais 
cultured overnight in BYG liquid medium containing 25 ^g/ml kanamycin, and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes, it was 

^ confimied that the plasmid had a structure In which the 1 .5 kb or 2.3 kb DNA fragment had been inserted into pGES30. 
The plasmids thus constaicted were named respectively pGlysG311 and pGzwf213.. 

(3) Introduction of mutation, Thr311 lie, in iysC into one point mutant HD-1 

35 [0393] Since the one mutation point mutant HD-1 in which the mutation, VaI59Ala, in bom was introduced into th 
wild type ATGG 13032 strain had been obtained in Example 2(2), the mutation, Thr311 lie, in /ysC was introduced into 
the HD-1 strain using pGlysG311 produced in the above (2) according to the gene replacement method described in 
Example 2(2). PGR was carried out using chromosomal DNA of the resulting strain and, as the primer set, DNAs having 
the nucleotide sequences represented by SEQ ID NOS:7006 and 7007 in the same manner as in Example 2(2). As a 
result of the fact that the nucleotide sequence of the PGR product was detenmi ned in the usual manner, it was conf inn d 
that the strain which was named AHD-2 was a two point mutant having the mutated i^sCgene in addition to the mutated 
horn gene. 

(4) Introduction of mutation, Pro458Ser, in pyc into two point mutant AHD-2 

45 

[0394] The mutation, Pro458Ser, in pyc was introduced into the AHD-2 strain using the pGpyc458 produced in Ex- 
ample 2(2) by the gene replacement method described in Example 2(2). PGR was carried out using chromosomal 
DNA of the resulting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID 
NOS:7004 and 7005 in the same manner as in Example 2(2). As a result of the fact that the nucleotide sequence of 
so the PGR product was detemiined in the usual manner, it was confimied that the strain which was named AHD-3 was 
a three point mutant having the mutated pyc gene in addition to the mutated ham gene and IysC gene. 

(5) Introduction of mutation. Ala213Thr, in zivf into thre point mutant AHP-3 

55 [0395] the mutation, Ala213Thr, in zwf was introduc d into the AHP-3 strain using the pGzwf458 produced in the 
above (2) by the gene replacement method described in Example 2(2). PGR was carried out using chromosomal DNA 
of the resulting strain and, as the primer s t, DNAs having th nucleotid sequences represented by SEQ ID NOS: 
7008 and 7009 in the same manner as in ExampI 2(2). As a result of the fact that the nucleotide sequence of the PGR 
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product was determined in the usual manner, it was confinned that the strain which was named APZ-4 was a four point 
mutant having the mutated zwf gene in addition to the mutated horn gen . lysC gene and pyc gene. 

(6) Lysine production test on HD-1 , AHD-2, AHP-3 and APZ-4 strains 

[0396] The HD-1, AHD-2, AHP-3 and APZ-4 strains obtained above were subjected to a culture test in a 5 I jar 
fermenter in accordance with the method of Example 2(3). 
[0397] Table 3 shows the results. 



Table 3 



Strain 


L-Lysine hydrochloride (g/l) 


Productivity (g/l/h) 


HD-1 


8 


0.3 


AHD-2 


73 


2.5 


AHP-3 


80 


2.8 


APZ-4 


86 


3.0 



[0398] Since the lysine-producing mutant B-6 strain which has been bred based on the random mutation and selection 
shows a productivity of less than 2.1 g/l/h, the APZ-4 strain showing a high productivity of 3.0 g/Uh is useful in industry. 

(7) Lysine fermentation by APZ-4 strain at high temperature 

[0399] The APZ-4 strain, which had been reconstructed by introducing 4 effective mutations into the wild type strain, 
was subjected to the culturing test in a 5 1 jar f emnenter in the same manner as in Example 2(3), except that the culturing 
temperature was changed to 40*^0. 
[0400] The results are shown in Table 4. 



Table 4 



Temperature ("C) 


L-Lysine hydrochloride (g/l) 


Productivity (g/l/h) 


32 


86 


3.0 


40 


95 


3.3 



[0401] As is apparent from the results shown in Table 4, the lysine hydrochloride titer and productivity in culturing at 
a high temperature of 40**C comparable to those at 32**C were obtained. In the mutated and bred lysine-producing B- 
6 strain constructed by repeating random mutation and selection, the growth and the lysine productivity are lower d 
at temperatures exceeding 34*'C so that lysine fennentation cannot be carried out, whereas lysine fermentation can 
be carried out using the APZ-4 strain at a high temperature of 40*^0 so that the load of cooling is greatly reduced and 
it is industrially useful. The lysine fermentation at high temperatures can be achieved by reflecting the high temperature 
adaptability inherently possessed by the wild type strain on the APZ-4 strain. 

[0402] As demonstrated in the reconstaiction of the lysine-producing strain, the present invention provides a novel 
breeding method effective for eliminating the problems in the conventional mutants and acquiring industrially advan- 
tageous strains. This methodology which reconstitutes the production strain by reconstituting the effective mutation is 
an approach which is efficiently carried out using the nucleotide sequence infomnation of the genome disclosed in the 
present invention, and its effectiveness was found for the first time in the present invention. 

Example4 

Production of DNA microarray and use thereof 

[0403] A DNA microarray was produced based on th nucleotide sequence infomnation of the ORF deduced from 
the full nucleotide sequences of Corynebacterium glutamicum ATCC 13032 using software, and genes of which ex- 
pression is fluctuated depending on th cart)on source during culturing w re searched. 

(1) Production of DNA microarray 

[0404] Chromosomal DNA was prepared from Corynebacterium glutamicum ATCC 1 3032 by the method of Saito et 
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al, ( Biochem. Biophys. Acta, 72. 619 (1963)). Bas d on 24 genes having the nucleotide sequences represented by 
SEQ ID NOS:207. 3433, 281, 3435. 3439, 765, 3445, 1226, 1229, 3448, 3451, 3453, 3456, 1743, 3470, 2132, 3476. 
3477, 3485, 3488. 3489, 3494, 3496, and 3497 from th ORFs shown in Table 1 deduced from the full genom nucle- 
otide sequence of Corynebacterium glutawicum ATCC 13032 using software and the nucleotide sequence of rabbit 
globin gene (GenBank Accession No. V00882) used as an internal standard, oligo DNA primers for PGR amplification 
represented by SEQ ID NOS:7010 to 7059 targeting the nucleotide sequences of the genes were synthesized in a 
usual manner. 

[0405] As the olrgo DNA primers used for the PGR, 

[0406] DNAs having the nucleotide sequence represented by SEQ ID NOS:7010 and 7011 were used for the ampli- 
fication of the DNA having the nucleotide sequence represented by SEQ ID NO:207, 

[0407] DNAs having the nucleotide sequence represented by SEQ ID NOS:7012 and 7013 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3433, ~ . 
[0408] DNAs having the nucleotide sequence represented by SEQ ID NOS:7014 and 7015 were used for the am- 
plification of the DNA haying the nucleotide sequence represented by SEQ ID NO:281, 

[0409] DNAs having the nucleotide sequence represented by SEQ ID NOS:7016 and 7017 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3435, 

[0410] DNAs having the nucleotide sequence represented by SEQ ID NOS:7018 and 7019 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3439, 

[0411] DNAs having the nucleotide sequence represented by SEQ ID NOS:7020 and 7021 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:765, 

[0412] DNAs having the nucleotide sequence represented by SEQ ID NOS:7022 and 7023 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3445, 

[0413] DNAs having the nucleotide sequence represented by SEQ ID NOS:7024 and 7025 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO: 1226, 

[0414] DNAs having the nucleotide sequence represented by SEQ ID NOS:7026 and 7027 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1229. 

[0415] DNAs having the nucleotide sequence represented by SEQ ID NOS:7028 and 7029 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3448, 

[0416] DNAs having the nucleotide sequence represented by SEQ ID NOS:7030 and 7031 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3451 , 

[0417] DNAs having the nucleotide sequence represented by SEQ ID NOS:7032 and 7033 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3453, 

[0418] DNAs having the nucleotide sequence represented by SEQ ID NOS:7034 and 7035 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3455, 

[0419] DNAs having the nucleotide sequence represented by SEQ ID NOS:7036 and 7037 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO: 1743, 

[0420] DNAs having the nucleotide sequence represented by SEQ ID NOS:7038 and 7039 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3470, 

[0421] DNAs having the nucleotide sequence represented by SEQ ID NOS:7040 and 7041 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:2132, 

[0422] DNAs having the nucleotide sequence represented by SEQ ID NOS:7042 and 7043 were uised for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3476, 

[0423] DNAs having the nucleotide sequence represented by SEQ ID NOS:7044 and 7045 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3477, 

[0424] DNAs having the nucleotide sequence represented by SEQ ID NOS:7046 and 7047 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3485, 

[0425] DNAs having the nucleotide sequence represented by SEQ ID NOS:7048 and 7049 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3488, 

[0426] DNAs having the nucleotide sequence represented by SEQ ID NOS:7050 and 7051 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3489. 

[0427] DNAs having the nucleotide sequence represented by SEQ ID NOS:7052 and 7053 were us d for the am- 
plification of the DNA having the nucleotide sequence repres nted by SEQ ID NO:3494, 

[0428] DNAs having the nucleotid sequence represent d by SEQ ID NOS:7054 and 7055 wer used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3496, 

[0429] DNAs having the nucleotid s quence repr sented by SEQ ID NOS:7056 and 7057 wer used for the am- 
plification of the DNA having the nucleotide sequenc represented by SEQ ID NO:3497. and 
[0430] DNAs having the nucleotid sequence represented by SEQ ID NOS:7058 and 7059 were used for th am- 
plification of the DNA having the nucleotide sequence of the rabbit globin gen , ' 
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as the respective primer set. 

[0431] The PGR was carried for 30 cycles with ach cycle consisting of 1 5 seconds at 95'*C and 3 minutes at 68**C 
using a thennal cycler (GeneAmp PGR system 9600, manufactured by P rtdn Elm r). TaKaRa EX-Taq (manufactured 
by Takara Shuzo), 100 ng of the chromosomal DNA and the buffer attached to the TaKaRa Ex-Taq reagent. In the case 
of the rabbit globin gene, a single-stranded cDNA which had been synthesized from rabbit globin mRNA (manufactured 
by Life Technologies) according to the manufacture's instructions using a reverse transcriptase RAV-2 (manufactured 
by Takara Shuzo). The PGR product of each gene thus amplified was subjected to agarose gel electrophoresis and 
extracted and purffted using QIAquick Gel Extraction Kit (manufactured by QIAGEN). The purified PGR product was 
concentrated by precipitating it with ethanol and adjusted to a concentration of 200 ng/fil. Each PGR product was 
spotted on a slide glass plate (manufactured by Matsunami Glass) having MAS coating in 2 runs using GTMASS 
SYSTEM (manufactured by Nippon Laser & Electronics L^.) according to the manufacture's instructions. 

(2) Synthesis of fluorescence labeled cDNA 

[0432] The ATGG 13032 strain was spread on BY agar medium (medium prepared by adding 20 g of peptone (man- 
ufactured by Kyokuto Pharmaceutical). 5 g of yeast extract (manufactured by Difco). and 16 g of Bactoagar (manufac- 
tured by Difco) to in 1 liter of water and adjusting its pH to 7.2) and cultured at SO^'C for 2 days. Then, the cultured 
strain was further inoculated into 5 ml of BY liquid medium and cultured at 30**G overnight. Then, the cultured strain 
was further inoculated into 30 ml of a minimum medium (medium prepared by adding 5 g of ammonium sulfate, 5 g of 
urea, 0.5 g of monopotassium dihydrogenphosphate, 0.5 g of dipotassiurii monohydrogenphosphate, 20.9 g of mor- 
pholinopropanesulfonic acid, 0.25 g of magnesium sulfate heptahydrate, 10 mg of calcium chloride dihydrate, 10 mg 
of manganese sulfate monohydrate, 10 mg of fen^ous sulfate heptahydrate, 1 mg of zinc sulfate heptahydrate, 0.2 mg 
copper sulfate, and 0.2 mg biotin to 1 liter of water, and adjusting its pH to 6.5) containing 110 mmol/l glucose or 200 
mmol/l ammonium acetate, and cultured in an Erienmyer flask at 30"" to give 1 .0 of absori[)ance at 660 nm. After the 
cells were prepared by centrifuging at 4*G and 5,000 rpm for 10 minutes, total RNA was prepared from the resulting 
cells according to the method of Bonnann etaL ( Molecular Microbiology, e. 317-326 (1992)). To avoid contamination 
with DNA, the RNA was treated with Dnasel (manufactured by Takara Shuzo) at 37**C for 30 minutes and then further 
purified using QIagen RNeasy MiniKit (manufactured by QIAGEN) according to the manufacture's instructions. To 30 
]ig of the resulting total RNA, 0.6 \i\ of rabbit globin mRNA (50 ng/^il, manufactured by Life Technologies) and 1 ^1 of 
a random 6 mer primer (500 ng/jil, manufactured by Takara Shuzo) were added for denaturing at 65**G for 1 0 minutes, 
followed by quenching on ice. To the resulting solution, 6 [il of a buffer attached to Superscript II (manufactured by 
Lifetechnologies), 3 ^1 of 0.1 mol/l DTT. 1.5 jil of dNTPs (25 mmol/l dATP, 25 mmol/I dGTP, 25 mmol/l dGTP. 10 mmol/ 
I dTTP), 1 .5 ^1 of Gy5-dUTP or Gy3-dUTP (manufactured by NEN) and 2 [il of Superscript II were added, and allowed 
to stand at 25''G for 10 minutes and then at 42''G for 110 minutes. The RNA extracted from the cells using glucose as 
the carbon source and the RNA extracted from the cells using ammonium acetate were labeled with Gy5-dUTP and 
Cy3-dUTP, respectively. After the fluorescence labeling reaction, the RNA was digested by adding 1.5 ^ll of 1 mol/l 
sodium hydroxide-20 mmol/l EDTA solution and 3.0 ^ll of 10% SDS solution, and allowed to stand at 65*'G for 10 
minutes. The two cDNA solutions after the labeling were mixed and purified using Qiagen PGR purification Kit (man- 
ufactured by QIAGEN) according to the manufacture's instructions to give a volume of 10^1. 

(3) Hybridization 

[0433] UltraHyb (11 0 ^1) (manufactured by Ambion) and the fluorescence-labeled cDNA solution (1 0 ^l) were mixed 
and subjected to hybridization and the subsequent washing of slide glass using GeneTAG Hybridization Station (man- 
ufactured by Genomic Solutions) according to the manufacture's instructions. The hybridization was carried out at 
50''G, and the washing was canied out at 25°G. 

(4) Ruorescence analysis 

[0434] The fluorescence amount of each DNA array having the fluorescent cDNA hybridized therewith was measured 
using ScanArray 4000 (manufactured by GSI Lumonics). 

[0435] Table 5 shows the Gy3 and Gy5 signal Intensities of the genes having been corrected on the basis of th data 
of the rabbit globin used as the int mal standard and the Gy3/Gy5 ratios. 



TabI 5 



SEQ ID NO 


Cy3int nsity 


CySint nsity 


Cy3/Cy5 


207 


5248 


3240 


1.62 
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Tables (continued) 



SEQ ID NO 


C\/3 inten^itv 


OmR intAnQrtv 
vyy sj iiiioiioiiy 




0*rOO 






U.OO 


£.0 1 






rt Q1 






9^1 


i AO 


0*r057 


009/ 




n Qi 

U.Ol 












1 IDs 




u.y 1 




imi 




U,o/ 


190Q 


1 1DO 


11 Q1 


i AO 

l.Uo 




11ft7 


1 RQA 


A 7 A 


OA 1^1 
0*f01 


c,OHO 




A 7>l 
U. /H 






i7ni; 
1 / uo 


O AK 




iAQi 


im*t 


1 OA 


1 f HO 


1079 


ICMi 


i A7 






Q7ftA 
O/D*!' 


1 OA 




117*5 


lUOO 


1 AQ 

l.Uo 








i OA 

i.oU 


3477 


1284 


1164 


1.10 


3486 


4539 


8014 


0.57 


3488 


34289 


1398 


24.52 


3489 


43645 


1497 


29.16 


3494 


3199 


2503 


1.28 


3496 


3428 


2364 


1.45 


3497 


3848 


3358 


1.15 



[0436] The ORF function data estimated by using software were searched for SEQ ID NOS:3488 and 3489 showing 
remarkably strong Cy3 signals. As a result, it was found that SEQ ID NOS:3488 and 3489 are a maleate synthas 
gene and an isocitrate lyase gene, respectively. It is Icnown that these genes are transcriptionally induced by acetic 
acid in Corynebactenum glutamicum {Archives of Microbiology, 168: 262-269 (1997)). 

[0437] As described above, a gene of which expression is fluctuates could be discovered by synthesizing appropriate 
oligo DNA primers based on the ORF nucleotide sequence infomnation deduced from the full genomic nucleotide 
sequence infomation of Corynebacterium glutamicum AJOC 1 3032 using software, amplifying the nucleotide sequenc- 
es of the gene using the genome DNA of Coryriebacterium glutamicum as a template in the PGR reaction, and thus 
producing and using a DNA microarray. 

[0438] This Example shows that the expression amount can be analyzed using a DNA microarray in the 24 genes. 
On the other hand, the present DNA microarray techniques make it possible to prepare DNA microanrays having thereon 
several thousand gene probes at once. Accordingly, it is also possible to prepare DNA microarrays having thereon all 
of the ORF gene probes deduced from the full genomic nucleotide sequence of Corynebactenum glutamicum ATCC 
13032 determined by the present invention, and analyze the expression profile at the total gene level of Corynebac- 
tenum glutamicum using these arrays. 

Example 5 

Homology search using Corynebactenum glutamicum genome sequence 
(1) Search of adenosine deaminase 

[0439] The amino acid sequence (ADD^ECOLI) of Escherichia co// adenosin deaminase was obtained from Swiss- 
prot Database as the amino acid sequence of the protein of which function had been conf imied as adenosine deaminase 
(EC3.5.4^). By using the full I ngth of this amino acid sequenc as a query, a homology s arch was canied out on a 
nucleotide sequence database of the genome sequence of Corynebactenum glutamicum or a databas of the amino 
acids in the ORF region deduced from the genome sequ nee using FASTA program (Proc. Natl. Acad. ScL ISA, 85: 
2444-2448 (1 988)). A case wh re E-valu was le-^o or less was judged as being significantly homologous. As a result, 
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no sequence significantty homologous with the Escherichia coli adenosine deaminase was found in the nucleotide 
sequence database of the genome sequenc of Corynebacterium glutamicum or the database of th amino acid se- 
quences in th ORF region d duced from th genom sequence. Bas d on these results, it Is assum d that Coryne- 
bacterium glutamicum contains no ORF having adenosine deaminase activity and thus has no activity of converting 
adenosine Into inosine. 

(2) Search of glycine cleavage enzyme 

[0440] The sequences (GCSP_ECOLI, GCST.ECOLI and GCSH_ECOLI) of glycine decarboxylase, aminomethyl 
transferase and an aminomethyl group carrier each of which is a component of Escherichia coli glycine cleavage 
enzyme as the amino acid sequence of the protein, of which function had been conftmned as glycine cleavage enzyme 
(EC2.1 .2,1 0), were obtained from Swiss-prot Database. 

[0441 ] By using these full-length amino acid sequences as a query, a homology search was carried out on a nucleotide 
sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF amino acid 
sequences deduced from the genome sequence using FASTA program. A case where E-value was le-^^ or less was 
judged as being significantly homologous. As a result, no sequence significantly homologous with the glycine decar- 
boxylase, the aminomethyl transferase or the aminomethyl group carrier each of which Is a component of Escherichia 
coli glycine cleavage enzyme, was found in the nucleotide sequence database of the genome sequence of Coryne- 
bacterium glutamicum or the database of the ORF amino acid sequences estimated from the genome sequence. Based 
on these results, it is assumed that Corynebacterium glutamicum contains no ORF having the activity of glycine de- 
cartK)xylase, aminomethyl transferase or the aminomethyl group earner and thus has no activity of the glycine cleavage 
enzyme. 

(3) Search of IMP dehydrogenase 

[0442] The amino acid sequence (IMDH ECOLI) of Escherichia co// IMP dehydrogenase as the amino acid sequence 
of the protein, of which function had been confirmed as IMP dehydrogenase (EC1 .1 .1 .205), was obtained from Swiss- 
prot Database. By using the full length of this amino acid sequence as a query, a homology search was carried out on 
a nucleotide sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF 
amino acid sequences predicted from the genome sequence using FASTA program. A case where E-value was le-^o 
or less was judged as being significantly homologous. As a result, the amino acid sequences encoded by two ORFs, 
namely, an ORF positioned In the region of the nucleotide sequence No. 61 5336 to 61 6853 (or ORF having the nucle- 
otide sequence represented by SEQ ID NO:672) and another ORF positioned in the region of the nucleotide sequence 
No. 616973 to 618094 (or ORF having the nucleotide sequence represented by SEQ ID NO:674) were significantly 
homologous with the ORFs of Escherichia coli IMP dehydrogenase. By using the above-described predicted amino 
acid sequence as a query in order to examine the similarity of the amino acid sequences encoded by the ORFs with 
IMP dehydrogenases of other organisms in greater detail, a search was carried out on GenBank (http://www.ncbi.nlm. 
nih.gov/) nr-aa database (amino acid sequence database constructed on the basis of GenBankCDS translation prod- 
ucts, PDB database, Swiss-Prot database. PIR database, PRF database by eliminating duplicated registrations) using 
BLAST program. As a result, both of the twp amino acid sequences showed significant homologies with IMP dehdy- 
rogenases of other organisms and cleariy higher homologies with IMP dehdyrogenases than with amino acid sequences 
of other proteins, and thus, it was assumed that the two ORFs would function as IMP dehydrogenase. Based on thes 
results, it was therefore assumed that Corynebacterium glutamicum has two ORFs having the IMP dehydrogenase 
activity. 

Example 6 

Proteome analysis of proteins derived from Corynebacterium glutamicum 

(1) Preparations of proteins derived from Corynebacterium glutamicum AJC0 13032, PERM BP-7134 and PERM BP- 
158 

[0443] Culturing tests of Corynebacterium glutamicum ATCC 1 3032 (wild type strain). Corynebacterium glutamicum 
PERM BP-7134 (lysine-producing strain) and Corynebacterium glutamicum (PERM BP-158. lysine-highly producing 
strain) wer caniedoutina5ljarfenn nter according to the method In ExampI 2(3). The results are shown in TabI 6. 
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Tables 



Strain 


L-Lysin yield (g/I) 


ATCC 13032 


0 


PERM BP-7134 


45 


PERM BP-158 


60 



[0444] After culturing, cells of each strain were recovered by centrifugation. These cells were washed with Tris-HCI 
buffer (1 0 mmol/l Tris-HCI, pH 6.5. 1 .6 mg/ml protease inhibitor (COMPLETE; manufactured by Boehringer Mannheim)) 
three times to give washed cells which could be stored under freezing at -80''C. The freeze*stored cells were thawed 
before use, and used as washed cells. 

[0445] The washed ceils described above were suspended in a disruption buffer (1 0 mmol/l Tris-HCI, pH 7.4, 5 mmol/ 
I magnesium chloride. 50 mg/l RNase, 1.6 mg/ml protease inhibitor (COMPLETE: manufactured by Boehringer Man- 
nheim)), and disrupted with a disrupter (manufactured by Brown) under cooling. To the resulting disruption solution. 
DNase was added to give a concentration of 50 mg/l, and allowed to stand on ice for 10 minutes. The solution was 
centrifuged (5,000 x g, 15 minutes, 4°C) to remove the undisrupted cells as the precipitate, and the supernatant was 
recovered. 

[0446] To the supernatant, urea was added to give a concentration of 9 mM, and an equivalent amount of a lysis 
buffer (9.5 mol/l urea, 2% NP-40, 2% Ampholine, 5% mercaptoethanol, 1 .6 mg/ml protease inhibitor (COMPLETE; 
manufactured by Boehringer Mannheim) was added thereto, followed by thoroughly stirring at room temperature for 
dissolving. 

[0447] After being dissolved, the solution was centrifuged at 12,000 x g for 15 minutes, and the supernatant was 
recovered. 

[0448] To the supematant, ammonium sulfate was added to the extent of 80% saturation, followed by thoroughly 
stirring for dissolving. 

[0449] After being dissolved, the solution was centrifuged (16,000 x g, 20 minutes, 4**C), and the precipitate was 
recovered. This precipitate was dissoh/ed in the lysis buffer again and used in the subsequent procedures as a protein 
sample. The protein concentration of this sample was determined by the method for quantifying protein of Bradford. 

(2) Separation of protein by two dimensional electrophoresis 

[0450] The first dimensional electrophoresis was carried out as described below by the isoelectric electrophoresis 
method. 

[0451] A molded dry IPG strip gel (pH 4-7, 13 cm, Immobiline DryStrips; manufactured by Amersham Pharmacia 
B|otech).was set in an electrophoretic apparatus (Multiphor M or IPGphor; manufactured by Amersham Phannacia 
Biotech) and a swelling solution (8 mol/l urea, 0.5% Triton X-100, 0.6% dithiothreitol, 0.5% Amphotine, pH 3-10) was 
packed therein, and the gel was allowed to stand for swelling 12 to 16 hours. 

[0452] The proteiri sample prepared above was dissolved in a sample solution (9 mol/l urea, 2% CHAPS, 1% dithi- 
othreitol, 2% Ampholine, pH 3-10). and then about 100 to 500 ^ig (In tenns of protein) portions thereof were taken and 
added to the swollen IPG strip gel. 

[0453] The electrophoresis was earned out in the 4 steps as defined below under controlling the temperature to 20** C: 

step 1 : 1 hour under a gradient mode of 0 to 500V; 
step 2: 1 hour under a gradient mode of 500 to 1 ,000 V; 
step 3: 4 hours under a gradient mode of 1 ,000 to 8,000 V; and 
step 4: 1 hour at a constant voltage of 8,000 V. 

[0454] After the isoelectric electrophoresis, the IPG strip gel was put off from the holder and soaked in an equilibration 
buffer A (50 mmol/l Tris-HCI, pH 6.8, 30% glycerol, 1% SDS, 0.25% dithiothreitol) for 15 minutes and another equili- 
bration buffer B (50 mmol/l Tris-HCI, pH 6.8, 6 mol/l urea, 30% glycerol, 1 % SDS, 0.45% iodo acetamide) for 1 5 minutes 
to sufficiently equilibrat the gel. 

[0455] After the equilibrium, the IPG strip gel was lightly rins d in an SDS lectrophoresis buffer (1.4% glycine, 0.1% 
SDS, 0.3% Tris-HCI. pH 8.5), and th second dimensional electrophoresis depending on molecular weight was carried 
out as described below to separate th proteins. 

[0456] Specifically, the abov IPG strip gel was closely placed on 1 4% potyacrylamtde slub gel (1 4% potyacrylamide, 
0.37% bisacrylamide, 37.5 mmol/l Tris-HCI. pH 8.8, 0.1% SDS, 0.1% TEMED. 0.1% ammonium persulfate) and sub- 
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jected to electrophoresis under a constant voltage of 30 mA at 20°C for 3 hours to separate the proteins. 

(3) D tectlon of protein spot 

5 [0457] Coomassie staining was p rformed by th method of Gorg et al. {Electrophoresis, 9: 531 -546 (1 988)) for the 
slub gel after the second dimensional electrophoresis. Specifically, the slub gel was stained under shaking at 25°C for 
about 3 hours, the excessive coloration was removed with a decoloring solution, and the gel was thoroughly washed 
with distilled water. 

[0458] The results are shown in Rg. 2. The proteins derived from the ATCC 1 3032 strain (Rg. 2A). PERM BP-71 34 
10 strain (Rg. 2B) and PERM BP-158 strain (Pig. 2C) could be separated and detected as spots. 

(4) In-gel digestion of detected protein spot 

[0459] The detected spots were each cut out from the gel and transferred into siliconized tube, and 400 of 100 
15 mmol/1 ammonium bicarbonate : acetonitrile solution (1:1, v/v) was added thereto, followed by shaking overnight and 
f reeze-dried as such, to the dried gel, 1 0 ^ of a lysylendopeptidase (LysC) solution (manufactured by WAKO, prepared 
with 0.1% SDS-containing 50 mmol/l ammonium bicarbonate to give a concentration of 100 ngf\x\) was added and th 
gel was allowed to stand for swelling at O'^C for 45 minutes, and then allowed to stand at 37''C for 16 hours. After 
removing the LysC solution, 20 ^1 of an extracting solution (a mixture of 60% acetonitrile and 5% fonnic acid) was 
20 added, followed by ultrasonication at room temperature for 5 minutes to disrupt the gel. After the disruption, the extract 
was recovered by centrifugation (12.000 rpm, 5 minutes, room temperature). This operation was repeated twice to 
recover the whole extract. The recovered extract was concentrated by centrifugation in vacuo to halve the liquid volume. 
To the concentrate, 20 ^1 of 0.1% trifluoroacetic acid was added, followed by thoroughly stining, and the mixture was 
subjected to desalting using ZIpTip (manufactured by Millipore). The protein absoribed on the carriers of ZipTIp was 
25 eluted with 5 |xl of a-cyano-4-hydroxycinnamic acid for use as a sample solution for analysis. 

(5) Mass spectrometry and amino acid sequence analysis of protein spot with matrix assisted laser desorption ionization 
time of flight mass spectrometer (MALDI-TOPMS) 

30 [0460] The sample solution for analysis was mixed in the equivalent amount with a solution of a peptide mixture for 
mass calibration (300 nmol/l Angiotensin II, 300 nmoI/1 Neurotensin, 150 nmol/l ACTHcllp 18-39, 2.3 fimol/l bovine 
Insulin B chain), and 1 ^1 of the obtained solution was spotted on a stainless probe and crystallized by spontaneously 
drying. 

[0461] As measurement instruments, REPLEX MALDI-TOP mass spectrometer (manufactured by Bruker) and an 
35 N2 laser (337 nm) were used in combination. 

[0462] The analysis by PMP (peptide-mass finger printing) was carried out using integration spectra data obtained 
by measuring 30 times at an accelerated voltage of 1 9.0 kV and a detector voltage of 1 .50 kV under reflector mode 
conditions. Mass calibration was canied out by the ihtemal standard method. 

[0463] The PSD (post-source decay) analysis was carried out using integration spectra obtained by successiv ly 
^ altering the reflection voltage and the detector voltage at an accelerated voltage of 27.5 kV. 

[0464] The masses and amino acid sequences of the peptide fragments derived from the protein spot after digestion 
were thus determined. 

(6) Identification of protein spot 

45 

[0465] Prom the amino acid sequence information of the digested peptide fragments derived from the protein spot 
obtained in the above (5), ORPs conresponding to the protein were searched on the genome sequence database of 
Corynebacterium glutamicum ATCC 13032 as constructed in Example 1 to identify the protein. 
[0466] The identification of the protein was earned out using MS-Rt program and MS-Tag program of intranet protein 
50 prospector. 

(a) Search and identification of gene encoding high- xpression protein 

[0467] In the prot ins derived from Corynebacterium giutamicum ATCC 1 3032 showing high expr ssion amounts in 
55 CBB-staining shown In Rg. 2A, the proteins con^sponding to Spots-1 ,2,3,4 and 5 were identified by the above method. 
[0468] As a result, it was found that Spot-1 corresponded to nolase which was a protein having the amino acid 
sequence of SEQ ID NO:4585;Spot-2 corresponded to phosphoglyc lat kinas which was a protein having the amino 
acid sequenc of SEQ ID NO:5254; Spot-3 corresponded to glyceraldehyde-3-phosphate dehydrogenase which was 
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a protein having the amino acid sequence represented by SEQ ID NO:5255; Spot-4 000* sponded to fructos bis- 
phosphate aldolase which was a protein having the amino acid sequence represented by SEQ 10 NO:6543; and Sppt- 
5 conresponded to triose phosphate Isomerase which was a protein having the amino acid sequence represented by 
SEQ ID NO:5252. 

5 [0469] These genes, represented by SEQ ID NOS:1085, 1754, 1776. 3043 and 1752 encoding the proteins con^e- 
sponding to Spots-1, 2, 3, 4 and 5, respectively, encoding the known proteins are important in the central metabolic 
pathway for maintaining the life of the microorganism. Particularly, it is suggested that the genes of Spots-2, 3 and 5 
fomn an operon and a high-expression promoter is encoded in the upstream thereof {J. ofEacterloL, 174: 6067-6086 
(1992)). 

10 [0470] Also, the protein conresponding to Spot-9 in Rg. 2 was identified in the same manner as described abov , 
and it was found that Spot-9 was an elongation factor Tu which was a protein having the amino acid sequence repre- 
sented by SEQ ID No:6937, and that the protein was encoded by DNA having the nucleotide sequence represented 
by SEQ ID No:3437. 

[0471] Based on these results, the proteins having high expression level were identified by proteome analysis using 
15 the genome sequence database of Corynebacterium glutamicum constructed In Example 1 . Thus, the nucleotide s - 
quences of the genes encoding the proteins and the nucleotide sequences upstream thereof could be searched simul- 
taneously. Accordingly, it is shown that nucleotide sequences having a function as a high-expression promoter can be 
efficiently selected. 

20 (b) Search and identification of modified protein 

[0472] Among the proteins derived from Corynebacterium glutamicum PERM BP-7134 shown in Fig. 28, Spots-6, 
7 and 8 were identified by the above method. As a result, these three spots all con-esponded to catalase which was a 
protein having the amino acid sequence represented by SEQ ID NO:3785. 
25 [0473] Accordingly, all of Spots-6. 7 and 8 detected as spots differing in isoelectric mobility were all products derived 
from a catalase gene having the nucleotide sequence represented by SEQ ID No:285. Accordingly, it is shown that 
the catalase derived from Corynebacterium glutamicurti PERM BP-7134 was modified after the translation. 
[0474] Based on these results, it is confirmed that various modified proteins can be efficiently searched by proteome 
analysis using the genome sequence database of Corynebacterium y/ufam/cum constructed in Example 1. 

30 

(c) Search and identification of expressed protein effective in lysine production 

[0475] It was found out that in Rg. 2A (ATCC 13032: wild type strain). Rg. 2B (PERM BP-7134: lysine-producing 
strain) and Fig. 2C (PERM BP-158: lysine-hlghly producing strain), the catalase corresponding to Spot-8 and the elon- 
35 gation factor Tu corresponding to Spot-9 as identified above showed the higher expression level with an increase in 
the lysine productivity. 

[0476] Based on these results, it was found that hopeful mutated proteins can be efficiently searched and identified 
in breeding aiming at strengthening the productivity of a target product by the proteome analysis using the genome 
sequence database of Corynebacterium glutamicum constructed in Example 1. 

[0477] Moreover, useful mutation points of useful mutants can be easily specified by searching the nucleotide se- 
quences (nucleotide sequences of promoter, ORP, or the like) relating to the identified proteins using the above data- 
base and using primers designed on the basis of the sequences. As a result of the fact that the mutation points are 
specified, industrially useful mutants which have the useful mutations or other useful mutations derived herefrom can 
be easily bred. 

45 [0478] While the invention has been described in detail and with reference to specific embodiments thereof, it will 
be apparent to one of skill in the art that various changes and modifications can be made therein without departing 
from the spirit and scope thereof. All references cited herein are incorporated in their entirety 



50 Claims 

1- A method for at least one of the following: 

(A). identifying a mutation point of a g ne derived from a mutant of a corynefomn bacterium. 
55 (B) measuring an xpresston amount of a gene derived from a corynefomi bacterium, 

(C) analyzing an xpression proftl of a gene derived from a coryneform bact rium, 

(D) analyzing expression pattems of genes derived from a corynefomri bacterium, or 

(E) identifying a gen homologous to a gene deriv d from a corynefomi bacterium. 
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said method comprising: 

(a) producing a polynuci otid array by adhering to a solid support at least two polynucleotides selected 
from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any . 
one of SEQ ID N0S:1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 
stringent conditions, and third polynucleotides comprising a sequence of 10 to 200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 
labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 

2. The method according to claim 1, wherein the coryneform bacterium is a microorganism belonging to the genus 
Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

3. The method according to claim 2, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilumt Corynebacterium 
acetoglutamicum, Corynebacterium callunae, Corynebacterium hercuiis, Corynebacterium lilium, Corynebacteri- 
um melassecola, Corynebacterium tfiermoaminogenes, and Corynebacterium ammoniagenes. 

4. The method according to claim 1 , wherein the polynucleotide derived from a coryneform bacterium, the polynuce- 
lotide derived from a mutant of the corynefomn bacterium or the polynucleotide to be examined is a gene relating 
to the biosynthesis of at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharid , 
an organic acid, and analogues thereof. 

5. The method according to claim 1 , wherein the polynucleotide to be examined is derived from Escherichia coii, 

6- A polynucleotide array, comprising: 

at least two polynucleotides selected from the group consisting of first polynucleotides comprising the nucle- 
otide sequence represented by any one of SEQ ID N0S:1 to 3501, second polynucleotides which hybridize 
with the first polynucleotides under stringent conditions, and third polynucleotides comprising 10 to 200 con- 
tinuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 

7. A polynucleotide comprising the nucleotide sequence represented by SEQ ID N0:1 or a polynucleotide having a 
homology of at least 80% with the polynucleotide. 

8. A polynucleotide comprising any one of the nucleotide sequences represented by SEQ ID NGS:2 to 3431 , or a 
polynucleotide which hybridizes with the polynucleotide under stringent conditions. 

9. A polynucleotide encoding a polypeptide having any one of the amino acid sequences represented by SEQ ID 
NOS:3502 to 6931, or a polynucleotide which hybridizes therewith under stringent conditions. 

10. A polynucleotide which is present in the 5* upstream or 3' downstream of a polynucleotide comprising the nucleotide 
sequence of any one of SEQ ID N0S:2 to 3431 in a whole polynucleotide comprising the nucleotide sequence 
represented by SEQ ID N0:1, and has an activity of regulating an expression of the polynucleotide. 

11. A polynucleotide comprising 10 to 200 continuous bases in the nucleotide sequence of the polynucleotide of any 
one of claims 7 to 1 0, or a polynucleotid comprising a nucleotide sequence complementary to the polynucleotide 
comprising 10 to 200 continuous based. 

12. A recombinant DNA comprising the polynucleotide of any on of claims 8 to 11 . 

13. A transfonnant comprising the polynucleotide of any on of claims 8 to 11 or the recombinant DNA of claim 12. 

14. A method for producing a potypeptid , comprising: 
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culturing the transformant of claim 13 in a m dium to produce and accumulat a potypeptid encoded by the 
polynucleotid of claim 8 or 9 in the medium, and 
recovering the polypeptide from the medium; 

1 5. A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof, comprising: 

culturing the transformant of claim 13 in a medium to produce and accumulate at least one of an amino acid, 
a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof in the medium, and 
recovering the at least one of the amino acid, the nucleic acid, the vitamin, the saccharide, the organic acid, 
and analogues thereof from the medium. 

16. A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID N0S:2 to 
3431. 

17. A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931. 

18. The polypeptide according to claim 16 or 17, wherein at least one amino acid is deleted, replaced, inserted or 
added, said polypeptides having an activity which is substantially the same as that of the polypeptide without said 
at least one amino acid deletion, replacement, insertion or addition. 

1 9- A polypeptide comprising an amino acid sequence having a homology of at least 60% with the amino acid sequenc 
of the polypeptide of claim 1 6 or 1 7, and having an activity which is substantially the same as that of the polypeptide. 

20. An antibody which recognizes the polypeptide of any one of claims 16 to 19. 

21. A polypeptide array, comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of claims 1 6 to 1 9 and 
partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

22. A polypeptide array, comprising: 

at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 
tides of claims 16 to 19 and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

23. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
fOHT) bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID N0S:1 
to 3501, and target sequence or target stmcture motif infonmation; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif infomiation; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

24. A method bas donacomput r for identifying a targets quence or a target structure motif derived from a coryne- 
fomi bacterium, comprising the following: 

(il inputting at least on nucleotide sequence infomiation selected from SEQ ID N0S:1 to 3501, target se- 
quence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing th at least one nucleotide sequence information s lected from SEQ ID N0S:1 to 3501 with 
the target sequence or target structure motif information; and 
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(iv) screening and analyzing nucleotide sequence infonnation which is coincident with or analogous to the 
targets quence or target structur motif information. 

25. A system based on a comput r for identifying a target sequence or a target structure motif deriv d from a coryne- 
form bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence infomiation selected from SEQ ID NOS: 
3502 to 7001 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input infomnation; 

(iii) a comparator that compares the at least one amino acid sequence infomnation selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif information, recorded by the data storage 
device for screening and analyzing amino acid sequence infomnation which is coincident with or analogous to 
the target sequence or target structure motif infomnation; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

26. A method based on a computer for Identifying a target sequence or a target stmcture motif derived from a coryne- 
form bacterium, comprising the following: 

(i) inputting at least one amino acid sequence infomnation selected from SEQ ID NOS:3502 to 7001 , and target 
sequence infomnation or target structure motif infomnation Into a user input device; 
(il) at least temporarily storing said Infomnation; 

(iii) comparing the at least one amino acid sequence infomnation selected from SEQ ID NOS:3502 to 7001 
with the target sequence or target stmcture motif infomnation; and 

(iv) screening and analyzing amino acid sequence infomnation which is coincident with or analogous to the 
target sequence or target structure motif information. 

27. A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having a 
target nucleotide sequence derived from a corynefomn bacterium, comprising the following: 

(i) a user input device that Inputs at least one nucleotide sequence information selected from SEQ ID NOS:2 
to 3501, function infomnation of a polypeptide encoded by the nucleotide sequence, and target nucleotide 
sequence information; 

(il) a data storage device for at least temporarily storing the input Infomnation; 

(iii) a comparator that compares the at least one nucleotide sequence infomnation selected from SEQ ID NOS: 
2 to 3501 with the target nucleotide sequence information for detemnining a function of a polypeptide encoded 
by a polynucleotide having the target nucleotide sequence which Is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOS:2 to 3501 ; and 

(iv) an output devices that shows a function obtained by the comparator. 

28. A method based on a computer for detemnining a function of a polypeptide encoded by a polypeptide encoded by 
a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, qdmprising the fol- 
lowing: 

(i) inputting at least one nucleotide sequence infomnation selected from SEQ ID NOS:2 to 3501, function in- 
fomnation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence information; 
(il) at least temporarily storing said infomnation; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID N0S:2 to 3501 with 
the target nucleotide sequence infomnation; and 

(iv) determining a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 
from SEQ ID N0S;2 to 3501 . 

29. A system based on a computer for determining a function of a polypeptide having a target amino add sequence 
deriv_d from a corynefomn bacterium, comprising the following: 

(I) a user input device that Inputs at least on amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function Infomnation bas d on the amino acid s quence, and target amino acid sequence Infor- 
mation; 
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(ii) a data storing device for at least temporarily storing the Input information; 

(ili) a comparator that compar s the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target amino acid sequence information for determining a function of a polyp ptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 
one amino acid sequenc selected from SEQ ID NOS:3502 to 7001 ; and 
(iv) an output device that shows a function obtained by the compariator. 

30. A method based on a computer for detemiinlng a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(i) inputting at least one amino acid sequence infonnation selected from SEQ ID NOS:3502 to 7001, function 
infomriation based on the amino acid sequence, and target amino acid sequence mformation; 

(ii) at least temporarily storing said infomriation; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target amino acid sequence infomriation; and 

(iv) detemilning a function of a polypeptide having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3502 to 
7001. 

31 . The system according to any one of claims 23, 25, 27 and 29, wherein a corynefomi bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

32. The method according to any one of claims 24, 26, 28 and 30, wherein a corynefonn bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibactenum, or the genus Microbacterium. 

33. The system according to claim 31 . wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophiium, Corynebacterium 
acetoglutamicum, Corynebacterium caiiunae, Corynebacterium hercuiis, Corynebacterium lilium, Corynebacteri- 
um melassecola, Corynebacterium thennoaminogenes, and Corynebacterium ammoniagenes. 

34. The method according to claim 32, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogiutamicum, Corynebacterium cailunae, Corynebacterium herculis, Corynebacterium iiiium, Corynebacteri- 
um melassecota, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

35. A recording medium or storage device which is readable by a computer in which at least one nucleotide sequenc 
Information selected from SEQ ID N0S:1 to 3501 or function information based on the nucleotide sequence is 
recorded, and is usable in the system of claim 23 or 27 or tiie method of claim 24 or 28. 

36. A recording medium or storage device which is readable by a computer in which at least one amino acid sequenc 
information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid sequence 
is recorded, and is usable in tiie system of claim 25 or 29 or the method of claim 26 or 30. 

37. The recording medium or storage device according to claim 35 or 36, which is a computer readable recording 
medium selected from tf^e group consisting of a floppy disc, a hard disc, a magnetic tape, a random access memory 
(RAM), a read only memory (ROM), a magneto-optic disc (MO), CD-ROM, CD-R, CD-RW. DVD-ROM. DVD-RAM 
and DVDtRW. 

38. A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the Vai 
residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform bacterium 
is replaced with an amino acid residue other than a VaJ residue. 

39. A polypeptid comprising an amino acid sequence in which tiie Vai residue at the 59th position in th amino acid 
sequenc as represented by SEQ ID NO:6952 is replaced with an amino acid r sidu other than a Vat residue. 

40. The polypeptid according to claim 38 or 39, wher in the Vai residue at the 59th position is replac d with an Ala 
residue. 
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41. A polyp ptide having pyaivate carboxylase activity, comprising an amino acid sequ nee in whicli the Pro residu 
at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform bacteriuo) is 
replaced with an amino acid residue other than a Pro residue. 

42- A polypeptide comprising an amino acid sequ nee in which the Pro residue at the 458th position in th amino acid 
sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

43. The polypeptide according to claim 41 or 42, wherein the Pro residue at the 458th position is replaced with a Ser 
residue. 

44. The polypeptide according to any one of claims 38 to 43, which Is derived from Corynebacterium glutamlcum, 

45. A DNA encoding the polypeptide of any one of claims 38 to 44. 

46. A recombinant DNA comprising the DNA of claim 45. 

47. A transformant comprising the recombinant DNA of claim 46. 

48. A transfomnant comprising in its chromosome the DNA of claim 45. 

49- The transformant according to claim 47 or 48, which is derived from a coryneform bacterium. 

50. The transformant according to claim 49, which is derived from Corynebacterium glutamicum. 

51. A method for producing L-tysine, comprising: 

culturing the transformant of any one of claims 47 to 50 in a medium to produce and accumulate L-lysine in 
the medium, and 

recovering the L-lysine from the culture. 

52. A method for breeding a corynefonm bacterium using the nucleotide sequence infonnation represented by SEQ 
ID NOS:1 to 3431 , comprising the following: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a corynefonm bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a con^esponding nucleotide sequence in SEQ ID N0S:1 to 3431; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (i); 

(iii) introducing the mutation point into a corynefonn bacterium which is free of the mutation point, or deleting 
the mutation point from a corynefonn bacterium having the mutation point; and 

(iv) examining productivity by the femientation method of the compound selected in (i) of the coryneform 
bacterium obtained in (iii). 

53- The method according to clairh 52, wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
a signal transmission pathway. 

54- The method according to claim 52, wherein the mutation point is a mutation point relating to a useful mutation 
which Improves or stabilizes the productivity. 

55- A method for breading a corynefonn bacterium using the nucleotide sequence information represented by SEQ 
ID N0S:1 to 3431 , comprising: 

(i) comparing a nucleotide sequenc of a genome or gene of a production strain derived a corynefonn bacte- 
rium which has been subject d t mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous ther of by a f nnentation 
method, with a conresponding nucleotide sequ nee in SEQ ID N0S:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtain by (i); 

(iii) deleting a mutation point from a coryneform bacterium having the mutation point; and 
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(iv) examining productivity by the fermentation method of the compound select d in (i) of the corynefomn 
bacterium obtained in (iii). 

56. The method according to claim 55, wherein the g ne is a gene encoding an enzyme in a biosynth tic pathway or 
a signal transmission pathway. 

57. The method according to claim 55, wherein the mutation point is a mutation point which decreases or destabilizes 
the productivity. 

58. A method for breeding a corynefomi bacterium using the nucleotide sequence infomiation represented by SEQ 
ID N0S:2 to 3431, comprising the following: 

(i) identifying an isozyme relating to biosynthesis of at least one compound selected from an amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 

. quence infomnation represented by SEQ ID N0S:2 to 3431 ; 

(ii) classifying the isozyme identified in (i) into an isozyme having the same activity; 

(iii) mutating all genes encoding the isozyme having the same activity simultaneously; and 

(iv) examining productivity by a femientation method of the compound selected in (i) of the coryneform bac- 
terium which have been transfomied with the gene obtained in (iii). 

59. A method for breeding a corynefomi bacterium using the nucleotide sequence information represented by SEQ 
ID N0S:2 to 3431, comprising the following: 

(i) an-anging a function infomiation of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431; 

(ii) allowing the arranged ORF to con-espond to an enzyme on a known biosynthesis or signal transmission 
pathway; 

(iii) explicating an unknown biosynthesis pathway or signal transmission pathway of a corynefomi bacterium 
in combination with infomiation relating known biosynthesis pathway or signal transmission pathway of a co- 
ryneform bacterium; 

(iv) comparing the pathway explicated in (iii) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a corynefomi bacterium based on the nucleotide sequence infonmation to either 
strengthen a pathway which is judged to be important in the biosynthesis of the target useful product in (iv) or 
weaken a pathway which is judged not to be important in the biosynthesis of the target useful product in (iv). 

60. A corynefonm bacterium, bred by the method of any one of claims 52 to 59. 

61. The coryneform bacterium according to claim 60, which is a microorganism belonging to the genus Corynebacte- 
hum, the genus Brevibacterium, or the genus Microbacterium. 

62. The coryneform bacterium according to claim 61 , wherein the microorganism belonging to the genus Corynebac- 
terium is selected from the group consisting of Corynebacterium glutamicum, Corynebactenum acetoacidophilum, 
Corynebacterium acetoglutawicum, Corynebacterium callunae, Corynebacterium hercuUs, corynebacterium Iii- 
lum, Corynebacterium melassecola, Corynebacterium thermoamino genes, and Corynebacterium ammonia 
genes. 

63. A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid and an analogue thereof, comprising: 

culturing a coryneform bacterium of any one of claims 60 to 62 in a medium to produce and accumulate at 
least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, 
and analogues thereof; 
recovering the compound from the culture. 

64. The rnethod according to claim 63, wherein the compound is L-lysine. 

65. A method for identifying a protein relating to useful mutation based on proteom analysis, comprising the following: 

(i) preparing 
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a protein derived from a bacterium of a production strain of a coryneform bacterium which has been sub- 
jected to mutation breeding by a ferm ntation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogu s ther of, and 
a protein deriv d from a bacterium of a parent strain of the production strain; 

(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(iii) detecting the separated proteins, and comparing an expression amount of the protein derived from th 
production strain with that derived from the parent strain; 

(iv) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino add sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

66. The method according to claim 65, wherein the corynefonn bacterium is a microorganism belonging to the genus 
corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

67. The method according to claim 66, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium gtutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogiutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacteri- 
um melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 



68. A biologically pure culture of Corynebacterium glutamicum AHP-3 (FERM BP-7382) . 
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